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IN THE NAVAL ELECTRONICS PROBLEM* 
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INTRODUCTION 

The problem of procuring and allocating electronic equipment for Naval ships is an 
extremely vital one, involving an annual budget of fifty to one hundred million dollars, and, 
more important, the safe and efficient operation of the entire Navy. Haphazard or inefficient 
allocation plans are therefore expensive and dangerous, and the importance of a mathematical 
systems approach is obvious. On the other hand, the problem is complex and involved; there 
are hundreds of different ship classes, many available equipments, and all kinds of complex 
relationships within and between the ships and the equipments. The problem has been fully 
described and discussed by J. W. Smith (see [1]). As a result of the efforts of Smith and 
others, including the authors, a systematic analytical technique for obtaining acceptable 
allocation plans has been developed; a brief description of the method is given in [1]. The 
technique may also be applicable to the procurement problem. 

Computationaliy, the technique uses the standard linear programming model for the 
assignment problem; see [2]. The crux of the problem is therefore the evaluation of the coef- 
ficients in the linear program; i.e., finding values for the assignment of a given equipment to a 
given ship. We will here concentrate on this aspect of the problem. A brief description of the 
method has already been given in [3]. 

The technique used here has been described and discussed in [2] from the general, 
theoretical viewpoint, and for a greatly simplified version of the Naval Electronics Problem. 
The purpose of this paper is to present the detailed application of that technique to the Naval 
Electronics Problem in all its actual, real-life complexity. This paper is intended primarily, 
not for those whose main interest lies in the general, theoretical aspects of the allocation prob- 
lem, but rather for those who are concerned with Naval Electronics, and for those who are 
interested in the working out of a technique such as this in a particular case, with all the messy 
detail with which particular cases tend to be burdened. 





*Manuscript received 17 March 1958. 
The research described in this paper has extended over several years, during which the authors 
have been affiliated with various institutions. These include Princeton University, Bell Tele- 
phone Laboratories, Inc., The University of Wisconsin, The Hebrew University, and the National 
Bureau of Standards. 
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This paper discusses one aspect of the Naval Electronics Problem; the whole problem 
has been worked on by many people in many places and at many times. Among those who con- 
tributed significantly to the research described here are J. E. I. Heller, John P. Mayberry, 
Norman Shapiro, and George Suzuki. The most credit, though, must go to Jack W. Smith, 
whose ideas originated the project, and whose leadership kept it going throughout. 


DESCRIPTION OF THE PROBLEM 

It is useful to view the list of possible assignments as being arranged in tabular form. 
The rows of the table represent electronic equipments, the columns represent "positions," 

i.e., places where the equipments will be put. Thus the individual squares of the table represent 
assignments of electronic equipments to positions; we wish to fill in the table with the values of 
the various assignments represented by the squares. 

We will make use of the rather commonplace idea that the parts and the positions may be 
grouped into equivalence classes in considering an allocation problem. The parts—in our prob- 
lem the electronic equipments—will be grouped into models; two equipments are said to be of 
the same model if they are built to the same specifications and have identical performance 
characteristics. Thus if we are given a position and two equipments of the same model, then 
we have no preference as between assigning the one or the other equipment to the position. 

For convenience, we will define the void model, 0, to consist of no equipment at all. 

Because of the complex interrelationships between positions, the grouping of the 
positions is not so quickly achieved. The basic notions are those of priority, state, and good- 
ness, Priority refers to the intrinsic importance of the mission which a position is to fulfill. 
For example, a position on an aircraft carrier which is meant to contain an air search radar 
equipment has higher priority than a similar position on a destroyer. Similarly, a sonar 
position on an aircraft carrier may have lower priority than an air search radar position on 
the same ship. State refers to the model that is already installed in the position in question. 
Goodness refers, roughly speaking, to the suitability of the various models to the position in 
question. Thus a large model might be more suitable to an aircraft carrier than to a destroyer, 
or a certain model of a radar might be more suitable to a position requiring medium- or short- 
range equipment than to one requiring long-range equipment. 

If we were to go according to only one of these factors, the solution to our problem 
would be simple enough. Thus if we were to decide to consider priority only, then we would 
fill the position of highest priority firt,, the next position on the priority list next, and so on. 

If we were to consider state only, we would first fill all the positions with nothing installed, 
then all the positions with the most inferior equipment installed, and so on. But matters aren't 
so simple. High-priority positions may have fairly good equipments already installed, there 
may be equipments that are especially suitable only for low priority positions, etc. All kinds 
of combinations of the three basic factors can and do occur, in varying proportions, in real- 
life problems. It is precisely the interrelationships between the factors of priority, state, and 
goodness that complicate the solution to the naval electronics problem. Since this is the crux 
of the problem, we must ask the reader to bear with us while we examine more closely these 
concepts that we have introduced. 

Priority is determined by a directive, called the MIP (Material Improvement Plan), that 
is issued by CNO (the Chief of Naval Operations). Items in the MIP are pairs consisting of a 
ship class (e.g., CVA aircraft carrier, DMS destroyer) and a requirement (e.g., sonar, 
medium-range surface search radar). The MIP ranks these items priority-wise. Twopositions 
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that correspond to the same item have essentially similar missions, and so are considered to 
have the same priority. The relative priority of other positions is given by the relative 
priority of the corresponding MIP items. 

In practice, the MIP can be divided into a relatively small number of classes. Within 
each class, CNO has determined which item has priority; but this is always a much finer 
distinction than the large distinction made between the classes. The set of all positions 
corresponding to MIP items that belong to one of these large classes is called a priority group. 
So much for priority. 

The state of a position is determined by which model of equipment is already installed 
in the position (before the contemplated allocation). Nothing more need be said about the state 
at the present time. The set of all positions of a given state is called a state group. 

The concept of goodness is in reality associated with pairs consisting of a model and a 
position, rather than just with positions. Thus the corresponding concept for a position alone 
is that of a goodness vector, i.e., the set of all goodnesses obtained when we fix the position 
and let the model vary over all possible models. Two positions have the same goodness vector 
if each model is equally suitable to both positions. The set of all positions whose goodness 
vector is the same as that of a given position is called a goodness group. 

Goodness groups are determined in the following way: Insofar as the position is con- 
cerned, suitability of a model to a position is determined by the kind of ship on which the 
position is located, and the requirement which it is to fulfill, When we say "kind of ship"' we 
do not mean ship-class, which is a rather specific concept, but a much broader concept such as 
“space-critical ship" or "weight-critical ship" or their negatives. Whereas there may be fifty 
or more ship classes involved in a problem, there are usually no more than about four or five 











"kinds" of ships. "Requirement" is used here in the same way it was used before; examples of 
a requirement are sonar and medium-range surface search radar. A goodness group may now 
be defined as the set of all positions fulfilling the same requirement and located on ships of the 
same "kind," 


Summing up, the positions have been grouped in three independent ways: into priority 
groups, into state groups, and into goodness groups. It is important to note that the classifica- 
tion of a position by one of these methods in general has no effect on its classification by 
another method. 

We emphasize that the naval electronics problem need not be regarded as one huge 
allocation problem, but can be broken up into a number of smaller independent problems. 

There are usually many different models capable of filling a given electronic requirement. 
Conversely, a given model is often capable of filling more than one requirement. Thus, when 
considering the allocation problem for a given requirement, it is usually necessary to consider 
some models that are suitable for other requirements as well; these new requirements will 

then have to be considered at the same time as the original requirement. These in turn may 

be capable of being filled by other models, which necessitates considering still more require- 
ments. The process eventually terminates; in fact, electronic equipment in use by the Navy is 
sufficiently specific so that usually at most three or four requirements can be strung together 

in the manner indicated above. These three or four requirements are, in fact, usually closely 
related anyway, so that from the administrative point of view, it is also convenient to consider 
them at once. The entire allocation problem for all electronic requirements can be conveniently 
Separated into a number of smaller problems, each dealing with a "string" of requirements of 
the type described, together with all models appropriate for any of the requirements in the string. 
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In the rest of this paper, attention will be focussed on a single problem dealing with only one 
string of requirements. 

In Figure 1 we give a small example of what our structure might look like in a specific 
case; but Figure 1 is only meant to indicate some of the possible complex interrelationships, 
and should not be interpreted as giving the structure of a typical problem. Although a typical 
problem need not exhibit more complexity, it is usually a good deal larger. Following is a list 
giving the approximate order of magnitude for the number of groups and elements of each kind 
that might appear in a typical realistic allocation problem: 


Number of available equipments: 500 
Number of models: 8 
Number of positions: 800 
Number of MIP items: 40 
Number of priority groups: 5 
Number of state groups: 5 
Number of goodness groups: 5 


Our aim is to find the value—to be thought of intuitively as a "military worth"—of 
assigning a given equipment to a given position. We will assume that this value depends only 
on the model of the equipment, and on the priority,” state, and goodness groups of the position. 
Thus we are interested in determining all the values v(M;, P, G, Mj), where this expression 
stands for the value of assigning an equipment of Model M; to a position in priority group P 
and goodness group G which has an equipment of model M; already installed. 


Position No. 10 111 
MIP Item No. $s $ 
Priority Group No. 3 3 
State Group No. . 
Goodness Group No. 2 2 
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Figure 1 - An example of an electronics allocation problem. The electronic equipments that 
are available are divided among various models, as shown in the figure. The positions that are 
requesting equipments can be classified into groups in various different ways, as shown in the 
figure and explained in the text. 





*See "Priority Groups and the MIP." 
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BRIEF REVIEW OF [2] 

Let us review briefly the ideas introduced in [2], and the conclusions reached there. 
The central idea of [2] is that significant quantitative information about the values we are 
seeking can be obtained from qualitative decisions made by Naval Personnel. As an example, 
let us consider the MIP. 

The MIP is provided by CNO for guidance in making up procurement and allocation 
programs. On an intuitive basis, we may conclude that "all other factors being equal," the 
assignment of an equipment to a position higher on the MIP should have higher value than a 
similar assignment to a position lower on the MIP. In accordance with the terminology 
previously introduced, this can be formalized as follows: 


If P, and Po are priority groups such that 
P, is higher on the MIP than Po, then for 
every model M and goodness group G, we have 


(1) v(M, P,, G, 0) > v(M, Pp, G, 0) 


with strict inequality holding for nonvoid M. The MIP can thus be regarded as a set of 
inequalities involving the values we are seeking. As such, it does already give us quantitative 
information regarding the values. On the other hand, it is clear that much more information is 
needed in order even to approximate numerical values. 

Basically, the MIP consists of a set of decisions made by CNO in "small" or "token" 
allocation problems. The inequality (1) is tantamount to a decision by CNO that he would 
rather assign an equipment of M to a position in P, and G that has nothing installed than to a 
position in Po and G that has nothing installed (unless M is void). The "token" allocation 
decisions that constitute the MIP are of a very special kind—the positions involved in a given 
decision are always contained in the same goodness group and state group, and the equipments 
involved are of the same model. The inequalities that these decisions yield are not sufficient 
to determine our values numerically, even approximately. It is natural to try to extend the 
scope of the "token" decisions. Additional decisions will yield additional inequalities, and by 
means of these we will be able numerically to approximate the assignment values. 

Of course, it does not necessarily have to be CNO that reaches the decisions on these 
"token" problems. We will postulate the existence of a Naval Authority (called "the Board") 
authorized to exercise judgment and arrive at decisions on allocation problems. In practice, 
the Board might consist of either one or several members. 

Let us now turn to the conclusions reached in [2]. Starting out from a situation in which 
positions were distinguished only by their Priority and their State, we reached the conclusion 
that there are functions q and g such that 


(2) v(M;, P, M;) = a(P) (g(Mj) - g(M;)), 


where v(M,, P, M)) is the value of assigning an equipment of model M; toa position in 
priority group P which has an equipment of model M; already installed; q(P) is the priority 
rating of P, and g(M) is the goodness rating of M. We also showed that the goodness and 
priority-ratings could be fairly closely determined numerically using only the qualitative 
decisions of the Board in token allocation problems. Finally, the goodness rating g is such 
that if M,> amet M, ranges over all models, then 
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(3) max g(M,) = 1. 


PRIORITY GROUPS AND THE MIP 

The value vector associated with a given position b is the set of values obtained by 
assigning equipments of different models to b. A similar definition can be made for any 
function that associates a number with each pair consisting of a model and a position. A value 
group is the set of positions having a given value vector. A value group is the intersection of a 
priority group, a state group, and a goodness group. The priority group of a position depends 
only on the priority "class" of the MIP item to which the position corresponds, not on the MIP 
item itself. The formula for value introduced previously involves the tacit assumption that the 
distinction in priority between MIP items in the same priority group is small as compared to 
the distinction between priority groups. In the typical case presented, it cuts the number of 
values that must be ascertained from 1000 to 400; actually, as will be seen later, the saving is 
much greater than this. 

Somehow, though, the distinction between members of the same priority class should be 
taken into account. This will be done as follows: Define a subvalue group to be the set of 
positions in a value group that correspond to a given MIP item. If we solve the linear program 
under the above assumption (i.e., that the value vector is constant throughout a value group 
rather than just throughout a subvalue group), we will in general obtain some "split" value 
groups,* i.e., value groups whose positions are assigned equipments of various models 
(including possibly voids), instead of equipments of just one model. Since for a given model, 
all positions of a given value group V have the same value, the question then arises, which 
positions should be assigned which models. This question is answered by reference to the 
distinctions between activities within the priority group of which V is a subset: We simply 
assign the better models to the positions that are higher in the MIP list. If any subvalue group 
is then still split by the allocation, then the choice of assignments within the subvalue group is 
arbitrary. The process is entirely in accord with the intuitive meaning of the MIP list: the 
more important activities get the better equipments. (If we could, we would apply the method 
throughout the whole problem: the only trouble is that to define "better equipments" uniquely, 
we must have the value of a model constant throughout the positions under consideration, and 
of course that is not so unless we restrict ourselves to a single value group.) 


THE VALUE FUNCTION 

The difference between the mathematical structure of the problem presented in [2] and 
that presented here is that here the positions are distinguished by their goodness group as well 
as by their priority and state groups. However, we can apply the results obtained in [2] if we 
restrict our attention to one goodness group. Put differently, in using (2), we must take the 
goodness group G of a position into account, and allow the priority and goodness ratings to 
vary with goodness group as well as with priority group and model. We thus obtain 


(4) v(M;, P, G, Mj) ” q(P, G) (g(M;, G) - g(M ? G)) 


and 


(5) max g(M,,G) = 1, 
1 





*The number of split value groups never exceeds the number of models. 
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where M;, P, and M, are as before and G is the goodness group of the position to which the 
assignment is being made. 

For reasons that will become apparent later, let us henceforth call the g(M,, G) relative 
goodnesses rather than just goodnesses. The relative goodness vector g(G) = (g (M,, _ eae 
g(M,, G)) can be determined by the method given in section 5 of [2]. The q(P, G) may also be 
determined as in [2]. However, the evaluation of the q(P, G), unlike that of the relative good- 
ness ratings, can be considerably simplified. 

To determine the q(P, G) in accordance with the methods of section 6 of [2], we would 
ask the Board the following type of question: Given two positions, one in priority group P; 
with an equipment of model Mj already installed, the other in priority group P, with an 
equipment of model M; already installed, both in the same goodness group G, and an available 
model M better* in G than either M; or M;, to which position would you assign M? 

In our situation, we are presented with the possibility of an alternative type of question: 
Given two positions, one in goodness group G; with an equipment of model M; already installed, 
the other in goodness group G; with an equipment of model M, already installed, both in the 
3ame priority group P, and an available model M better in G; than M; and better in G; than 
Mj, to which position would you assign M? 

To distinguish these two types of questions, we will call the former a priority question, 
and the latter a top anchor question, for reasons which will soon be clear. According as to how 
atop anchor question is answered, we can decide which of 


(6) 








q(P, G)) g(M, G;) ad g(M;, G;) 


holds. Let us fix P, G;, and G.. For each M, M;, and M., the right side of (6) can be 
numerically evaluated (approximately) by the methods of section 5 of [2]. Thus by varying 
M, M;, and M., we will obtain a number of numerical estimates for the (fixed) right side of 
(6), both from above and from below. Taken together, these estimates approximately determine 
the ratio q(P, G,)/a (P, G)). The process is similar to that used for priority questions in 
section 6 of [2]. 

Let p(P) be the maximum of the q(P, G) over all goodness groups G. Define 


(7) t(P, G) = q(P, G)/p(P) . 

If the maximum of the q(P, G) is taken on for G = G(P) then we deduce from (7) that 

(8) t(P,G(P))=1. 

Clearly 

(9) a(P, G,)/q(P, G;) = t(P, G,)/t(P, G) . 

We have already seen how the left side of (9) may be determined. Hence we can also determine 


the right side of (9). (8) gives us the value of one of the t(P, G). Since we now know the ratios, 
we can determine the values of all the t(P, G) from the single value given by (8). 





*Formally, "having higher relative goodness." 
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Our determination of the values of the t(P, G) was based entirely on the answers to 
top-anchor questions, and on their interpretation in the form (6). Since the right side of (6) 
does not involve P in any way, the only way in which the values of the t(P, G) could depend on 
P is if the answers to the top anchor questions depended on P. As a matter of actual* fact, 
though, the answers to the top-anchor questions are independent of P. We conclude that t(P, G) 
is independent of P, so that we may write 


(10) t(P, G) = t(G). 
Combining (7) with (10) we obtain 
(11) a(P, G) = p(P) t(G) ; 


p(P) is called the priority rating of P; t(G) is called the top anchor of G. 
From (11) and (4) we deduce 





(12) v(M;, P, G, M;) = p(P) t (G) (g(M;, G) - g(M;, G) ) . 


The saving effected by (11) is considerable. In our typical example, (11) allows us to 
determine five t(G) and five p(P)—a total of ten parameters—instead of about twenty-five 
q(P, G) we might have had to determine in the absence of such a formula. 

Intuitively, (11) and (12) have the following meaning: The g(M, G) are supposed to be a 
measure of goodness, i.e., a measure of the suitability of a piece of equipment to its position. 
We have arbitrarily set the highest g(M, G) in a given G to be equal to 1. If this were to bea 
measure of absolute goodness, such a measure would be manifestly unrealistic. The reader 
can convince himself of this by considering what would happen if a new model were to be 
developed suitable for only one of the goodness groups, say G. The new model would have the 
highest possible goodness in G, namely 1; the old models would have goodnesses in G cor- 
respondingly smaller than 1; whereas in goodness groups other than G, the old models would 
retain their former goodnesses. Obviously either all goodnesses should be depressed with the 
introduction of the new model, or else the old goodnesses should be left undisturbed and the new 
model be given a goodness higher than 1; both are contrary to the stipulation that the highest 
goodness in a given goodness group be equal to 1. Obviously, therefore, the g(M, G) should be 
considered only as relative goodnesses, i.e., they should be considered as giving valid com- 
parisons within a goodness group only. To obtain comparisons valid between the goodness 
groups, each of the relative goodness vectors should be multiplied by a scalar factor, which 
compares the state of development in one goodness group with that in the others. Since this 
factor anchors the top goodness in any goodness group, it is called the top anchor. The absolute 
goodness in G of a model M is given by the product t(G) g(M, G). An absolute improvement 
is the difference between two absolute goodnesses; and as in [2], to obtain the value of an 
assignment, we must multiply the absolute improvement it accomplishes by the priority rating 
of the position which it fills. This is formula (12). 








*I.e., experimental. 
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We remark that (12) is not only a "reasonable" formula, but is actually the unique 
correct formula, assuming that linear programming is at all applicable and that there is a 
formula for value in terms of the priority, relative goodness, and top anchor ratings. More 
precisely, we are making the following assumptions: 

1. The problem may be set up as a linear program, 

2. The value of assigning an equipment to a position depends only on the model of the 
equipment and the priority, state, and goodness groups* of the position. 

3. Absolute goodness is independent of Priority Group. 

Precise mathematical statements and justification of all but one of these assumptions 
may be found in [2]. The additional assumption is embodied in (10), and is justified by the 
experimental conclusion that the answers to the top-anchor questions do not depend on P. 





NUMERICAL DETERMINATION OF TOP ANCHORS AND PRIORITY RATINGS 

To start with, we wish to assure the reader that neither the priority questions nor the 
top anchor questions will in practice look as forbidding as does the general form given in the 
previous section. The questions with which the board is confronted will each involve two 
specific ship-classes, with specific models installed and a specific model available for 
installation. These models and ship classes will be chosen by the questioner to conform with 
the specified form, but the board need not and should not concern itself with priority groups, 
goodness groups, and so on; it should concentrate on answering the specific questions put to it. 

Next, we should point out that there are some difficulties and pitfalls involved in the 
determination of the top anchors and priority ratings. As an example, let us consider the case 
of the top anchor. The process described in the previous section for obtaining the ratios 
between top anchors is in practice at best a rather rough one. Even if we were working with 
precise relative goodness ratings, it would be impossible to obtain precise top anchors. That 
is because within a certain area, usually something like 5%-10% of the top anchor in question, 
the decisions are difficult to make, come out as toss-ups, or even involve numerical con- 
tradictions of a minor sort. Thus we might get answers implying that a certain ratio is at the 
same time >.75 and <.73. This means that it is impossible to determine these ratios with 
greater precision. It is not only impossible, it is meaningless; to the extent that these 
parameters "exist" at all, they exist only in this loose, imprecise way. The imprecision in 
the top anchors is compounded by the imprecision in the relative goodness ratings, on which 
the determination of the top anchors is based. We may say, though, that given a set of relative 
goodness ratings, it is possible to determine top anchors with something between one and two 
significant figures. 

One precaution we can take in order to minimize, to the extent possible, the damage 
caused by imprecise relative goodness ratings is to stay away from ratios on the right side of 
(6) with small denominators. That is because imprecisions in a small denominator are 



















*Although we have used the vague idea of "'suitability''to give an intuitive meaning for goodness, 
the formal definition of goodness group involves only the well-defined (in any particular prob- 
lem) notions of ''kind"' of ship and requirement. Therefore any consideration affecting assign- 
ment value that depends only on these two notions can be formally incorporated into the good- 
ness concept. An example is the "Utilization Factor'' mentioned in [1]. Even if it is assumed 
that this is relevant, it should not appear explicitly in our formulae, because it depends only 
on the goodness group, and is therefore incorporated in the goodness ratings. Other examples 

might be such factors as repairability and obsolescence. Like goodness, they depend only on 
the requirement and "kind" of ship with which the position is associated (and of course on the 

model of the equipment to be installed). , 
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tremendously blown up in the quotient. In general, some components of the relative goodness 
vector are more "reliable" than others, as we shall see later; in asking top anchor questions, 
some attempt should be made to rely more on the more reliable ratios. To facilitate this, an 
"imprecision interval" should be calculated for each ratio and placed on the list of ratios that 
we are using to determine the ratio of our two top anchors. The questioner should then try to 
stick to questions with as small an interval of imprecision as possible. 

Theoretically, to obtain the top anchors, we pick an arbitrary priority group P and 
then determine all ratios of the form q(P, G,)/a(P, G;), i.e., all ratios of the form t(G,)/t (G)). 
From this listing it will be apparent which top anchor is the largest; we set this top anchor = 1, 
and deduce values for the other top anchors. Unfortunately, it turns out that this can usually not 
be done, since not all the priority groups intersect all the goodness groups. In fact, very few, 
if any, do. For the determination of any one ratio, say t(G,)/t (G)), we must find a priority 
group P intersecting both G; and G;, so that we can select activities in G; and G; respec- 
tively that can be compared by means of top anchor questions. Sometimes there is no such 
priority group. We must then find a "bridging" chain of priority groups "1, ones .™ and 
goodness groups i *, ™",? tees > where i, =i and in = J, Such that each :, intersects 
both .. and 4. Thus we will be able to determine each of the ratios on \ (G; ), and 
by multiplying them obtain the ratio we seek. 

In practice, we choose one goodness group G and try to determine all the ratios 
t (G,)/t (G). To the extent possible, no bridging should be used, and all ratios determined 
independently. That is because the "snowballing" of imprecisions that occurs when bridging 
is used becomes rapidly disastrous. Care should be taken in the choice of G to minimize the 
error that will be incurred due to bridging; if possible, G should be chosen so that bridging is 
entirely avoided. 


Everything we have said here about top anchors applies with equal force to priority 
ratings, which have an almost exactly symmetrical relation with top anchors. 


TIME PHASING 


In discussing the mathematical structure of the problem earlier, we completely ignored 
its time aspect. In point of fact, equipment is becoming available more or less continuously, 
and ships are coming in for overhaul more or less continuously. This imposes several 
additional tasks on our technique for arriving at allocation plans. We will discuss these under 
the headings of primary, secondary, and tertiary time phasing. 

Primary time phasing is simply the job of scheduling, i.e., fixing things so that equip- 
ment is available by the time the ship to which it is assigned comes into port for overhaul. 

Secondary time phasing is the problem of taking into account idle warehouse time of 
available equipment waiting to be installed, and to the extent possible minimizing such time. 
For instance, it may happen that assignment of equipment a to position 1 has a slightly higher 
value than assignment to position 2. As a result, equipment a is allocated to position 1, while 
a considerably inferior model is allocated to position 2. Now it happens that equipment a is 
available in January, the ship on which position 2 is located is coming in for overhaul in 
February, and the ship on which position 1 is located is not coming in until the following 
December. Since the value of assigning equipment a to position 1 is only slightly higher than 
assigning it to position 2, it seems clear that we shouldn't let the equipment lie around for 
almost a year just to wait for the ship on which position 1 is located. The example is 
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admittedly rather extreme, as usually ships of all kinds are coming in fairly often. However, 
it does serve to illustrate the concept of secondary time phasing. 

Tertiary time phasing is the long-range problem. It involves taking into account 
allocations that will be made at a time in the future which is not being taken care of in the 
present allocation plan. For example, let us say that we are making up a one-year allocation 
plan. We may have reason to believe that a new, superior model of equipment will become 
available for allocation next year. That means that we might want to assign more of the 
currently good models to the less important positions, so as to leave room on the more 
important positions for the superior model that may be available in the future. 

One way to handle the primary and secondary time-phasing problems is first to use 
the method described in the foregoing section to obtain an un-time-phased allocation plan, then 
to do the best one can "by hand" to time-phase the resulting plan. In practice, such a scheme 
works fairly well, because both ships and models of all kinds are coming in fairly frequently. 
Also, the ships stay in the yards for a period ranging from six weeks to several months, thus 
giving the time-phaser fairly wide latitude. In some instances, it may be necessary to make 
comparatively small changes in the allocation plan in order to make it conform to the require- 
ments of primary time-phasing. Actually, in the method now in use inthe Bureau of Ships, an 
un-time-phased allocation plan is first made up "by hand," and it is only afterwards time- 
phased, also "by hand." The method described in the foregoing sections may then be said to 
replace the first, but not the second of these processes. 

As of now, secondary time-phasing is considered of secondary importance. A blanket 
rule is made that all equipment becoming available during any given fiscal year will be assigned 
during that year, but within that restriction not much emphasis is placed on the problem. On 
the average, things work out satisfactorily. Of course, the great difficulty in this field is the 
lack of criteria, the difficulty in comparing the relative importance of assigning an equipment 
at once and assigning it to an important position. 

An alternative method has been developed for handling the primary and secondary time 
phasing problems—a systematic method that employs the principles of [2] and of the foregoing 
sections. However, this method does not enjoy the status of the un-time-phased method 
described in the foregoing sections in the sense that it has been developed only recently and 
has yet to be applied to and tested on real-life allocation problems. It will be described in 
detail in a subsequent paper. 

Tertiary time-phasing does not really constitute a problem at all. Allocation plans are 
currently made as far into the future as it is possible to say, even approximately, what will be 
available and what will be needed. Current practice in the Bureau of Ships is to prepare an 
allocation plan five years into the future! (This allocation plan is revised every year.) Beyond 
that, availability of equipment and other factors affecting allocation can only be taken into 
account in a generalized way. These factors will therefore automatically be considered in the 
token allocation decisions arrived at by the Board. Thus in a token allocation decision, the 
Board might very well decide to assign an equipment to the otherwise less important position, 
because of the tertiary time-phasing kind of reasoning. It might even reverse some pairs in 
what would otherwise be the "expected" goodness order for a given category. In any case, to 
the extent that the Board feels that the "tertiary" kind of consideration is significant, its feeling 
will be reflected in its decision and hence in the values. 
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MULTIPLE ALLOWANCES 

A ship S is said to have a multiple allowance in a certain requirement” if the number 
of positions on § that fill that requirement is greater than 1. Allowances for major elec- 
tronic equipment on large ships sometimes run as high as 7. Since all the positions filling a 
given requirement on board a given ship are associated with the same MIP item, they all have 
the same priority. This may lead to the anomalous situation in which a large ship gets its 
seventh equipment of a certain kind before a slightly less important ship gets its first. 

One way to deal with this problem is formally to consider each of the positions in a 
given multiple allowance as filling a different requirement. (This is actually intuitively 
justifiable; the first surface search radar on a ship obviously serves a different function from 
the second, which can only supplement the first.) Each of the positions would then occupy a 
different position on the MIP. In this ranking, the position with the best equipment already 
installed should rank highest, with the next best installed second highest, and so on. 


PROCUREMENT 

Up to now, we have been assuming that the number of equipments of each model avail- 
able at any one time is given us. Actually, of course, in the long run it is only the money that 
is given us; the problem of how many equipments of each model to buy with the available money 
must also be solved by the Bureau of Ships. This is the procurement problem. 

As might be expected, procurement and allocation are in fact intimately related. 
Procurement effectiveness can only be measured in terms of the allocation doctrine that will 
be applied following the procurement; conversely, inefficient procurement will greatly hamper 
any effort to obtain an allocation plan of high value. It would seem that the two problems are 
inseparable, and should be treated as a combined procurement-allocation problem, optimizing 
procurement and allocation simultaneously. 

As a matter of fact, the unified procurement allocation problem can be set up as a linear 
program, using the same objective function as the one that the allocation problem uses, but 
replacing the restraints governing model availability by money restraints. See [3]. The prob- 
lem of finding the coefficients of the objective function is of course the same in this problem as 
in the problem we have treated. 

There has been a certain amount of opposition to the use of the linear programming 
technique for determining procurement. Basically, this opposition is probably due to the fact 
that one of the fundamental reasons for introducing the technique into the field of allocation 
planning is missing in the case of procurement. This is the combinatorial complexity of 
allocation plans. Procurement plans are combinatorially relatively simple; they only involve 
the determination of how many of each of a fairly limited number of models to buy. It may be 
felt, with some degree of justification, that the whole procurement problem is combinatorially 
sufficiently simple as almost to be considered a "token" problem in itself. There is little point 
in then replacing one "token" problem by a set of different "token" problems. Of course, the 
difference between the procurement problem and a true "token" problem is that in the latter, 
we have a clear understanding of the precise implication of our decision insofar as the final 
allocation is concerned, whereas in the former we do not; nevertheless, it remains true that 


the procurement problem in itself does not "combinatorially overwhelm" the deciding officer, 
as does the allocation problem. 








*For the meaning of ''requirement,"' see page 2. 
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Another reason for differentiating between procurement and allocation is that these two 
events do not really occur simultaneously. Electronic equipment must be ordered an average 
of two years before it is delivered; thus procurement decisions must be made two years in 
advance of allocation decisions. A lot can happen in two years. It might not be wise to make 
procurement depend solely on our estimate of what optimal allocation will be in two years. 
Some flexibility should be built into a procurement plan that looks that far into the future, as 
insurance against a change in conditions. On the other hand, how do we measure flexibility? 
Do current methods arrive at procurement plans that are more flexible? At any rate, it can 
be seen that the extension of the technique into the field of procurement planning involves 
conceptual problems that are absent in the case of allocation planning and that have not yet 
been completely settled. 

The procurement-allocation problem we have been considering is only that involving a 
single string of requirements. Congress makes a single appropriation for the entire naval elec- 
tronics field, so even if we make the procurements within a string by "machine," we will still 
have to decide on the distribution of money between the strings by "hand." One way to get 
around this is to consider the entire Naval Electronics procurement-~allocation problem at the 
same time, comparing the values as between different strings of requirements by the use of 
"token" procurement-allocation questions, e.g.: If you had $100,000 available with which you 

. could either buy two surface-search radar equipments of model so-and-so or three radio 
transmitters of model such-and-such, and if you had these and those positions to fill, which 
equipments would you buy and how would you allocate them? 

We don't know whether the use of such questions is feasible, or whether we can hope to 
obtain meaningful answers to them; experimental work has been performed only on the 
procurement-allocation problem confined to one string of requirements, not on the over-all 
Naval Electronics procurement-allocation problem. As a final remark, we may say that the 
unified procurement-allocation problem for all of Naval Electronics would involve a very large 
matrix indeed, probably too large to go into the internal memory of any existing computer. * 


A BRIEF ACCOUNT OF EXPERIMENTAL WORK 

At the time the authors began work on this project, they were presented with a specific 
allocation problem. Involved were some 8 models of radar repeaters to be distributed among 
some 800 positions divided into 80 MIP items, 6 state groups, and 4 goodness groups. The 
system of bunching MIP items had not yet been invented, so that each MIP item constituted a 
separate priority group. 

Priority ratings were assigned purely arbitrarily (the lowest item on the MIP was 
assigned a priority rating of 60, and each succeeding item had a rating that exceeded the 
previous rating by 1; somewhere along in the middle a gap of about 4 or 5 was left) and the 
wrong formula for combining priority and goodness ratings was used. Even under such con- 
ditions, though, the allocation plan obtained was deented superior to the one that had been 
obtained by hand! This would seem to indicate that a systematic method using the available 





*A simplified computational technique for dealing with procurement problems involving several 
distinct allocation problems has recently come to our attention. By making use of Lagrangian 
multipliers, it allows us to consider each one of the distinct allocation problems separately, 
so that the whole large procurement problem need not be put on the computer at one time. An 
application of the technique is given in [4]. In our case, the "procurement problem" is the 
unified procurement problem for all of Naval Electronics, whereas the ''distinct allocation 
problems"! are those determined by the various strings. 





14 R. J. AUMANN AND J. B. KRUSKAL 


information in even the most cursory fashion yields results that are superior to those obtained 
by the use of haphazard methods. Another advantage of the new method was also noted at that 
time: several "bloopers" or absurdities in the hand solution had been eliminated in the sys- 
tematic linear programming solution, despite the use of arbitrary input and incorrect formulae, 
When the formula that later turned out to be the unique* correct one was adopted, still more 
acceptable solutions were obtained and absurdities eliminated. 

Another problem that was studied, this time from the point of view of procurement as 
well as allocation, involved 6 models of surface-search radar available for distribution among 
700 positions divided into 52 MIP items, 4 state groups, and 5 goodness groups. Work on this 
problem led to the system of grouping MIP items into priority classes and to the systematic 
determination of priority ratings, top anchors, and relative goodnesses as described above. 


NUMERICAL METHODS FOR OBTAINING RELATIVE GOODNESS RATINGS 

An example of the calculation of a relative goodness vector involving three models is 
given in [3]. Often there are more models to work with, and this gives us a more precise 
determination of the relative goodness vector. It is also true that an improvement order con- 
taining toss-up decisions seems not only to yield a relative goodness rating polyhedron of cor- 
respondingly smaller dimensionality, but also to reduce the diameter of the polyhedron. Thus 
a toss-up decision becomes doubly valuable. 

We make three additional remarks. First of all, it sometimes happens that where there 
are n models, more than n independent inequalities arise from the improvement order. In this 
case, we will obtain a polyhedron of possible relative goodness vectors that is not a simplex. 
To obtain the center of gravity of such a polyhedron, we break it up into simplexes. The center 
of gravity of the whole polyhedron is then the average of the centers of gravity of the component 
simplexes, weighted by their volumes. The center of gravity of a component simplex is found as 
illustrated in [3], while the volume is easily determined from the determinant of the vertices. 
In the instances that we have run across, the accompanying calculations have been remarkably 
quick and painless, largely because of the high ‘incidence of zeros in the determinants involved. 
In more complicated cases, a high-speed computing machine could profitably be used. 

Next, we note that for every polyhedron P of possible relative goodness vectors, there 
is a smallest! rectangular parallelopiped with sides parallel to the axes that contains P. From 
the fact that this rectangular parallelopiped is not in general a cube, we may deduce that the 
precision with which our method determines the components of the relative goodness vector 
usually varies from component to component. This is why some components of the relative 
goodness vector are more "reliable" than others, as we noted earlier in this paper. 

Our third remark is that occasionally the Board is only willing to give the beginning of 
an improvement order, but is unwilling to commit itself as to, say, its last third. We then 
obtain a "truncated" improvement order. This truncated improvement order leads to a poly- 
hedron of possible goodness vectors, and thence to a center of gravity, in exactly the same way 
that the full improvement order does. However, such a polyhedron will in general be somewhat 


larger than that obtained with a full improvement order, so that the resulting relative goodness 
vectors will be less "reliable." 





*See the discussion on page 9. 
TUnder inclusion. 
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CONCLUDING REMARKS 

1. Everything that was said in section 7 of [2] about the advantages and limitations of 
the type of method described there applies with equal force to this application. In particular, 
once it is established that the linear programming model is applicable, any set of questions 
may be used that determine the values sufficiently closely. Thus we may with a clear con- 
science pick out those questions that are most efficient from the computational viewpoint. 

2. There are two kinds of consistency problems associated with the work described 
here. One is the self-consistency problem. Will the token decisions made by any one Board 
yield consistent numerical answers? As stated above and in section 7 of [2], the applicability 
of the mathematical model may stand or fall on this question. The available experimental data 
seems to point to self-consistency; the only inconsistencies encountered were of a very minor 
nature (see page 9). We concede, though, that this question has not yet received sufficient 
attention and should be investigated on a more systematic basis. 

The second consistency problem is that of consistency as between the numerical 
implications of decisions made by different naval officers or different Boards. We have stated 
before, and we state now that this is not a requirement that our system need satisfy. An 
allocation plan and the token decisions leading to it actually constitute military command 
decisions. Military decisions are always made by the officers appointed for the purpose of 
making them; they are not less valid because other officers, in the same position, might have 
made somewhat different decisions. The purpose of the method described here is not to replace 
the system of military command by a mathematical system; it is to transform a command 
decision which because of its enormous complexity must needs be made on a largely haphazard 
basis into a set of less complicated command decisions, each of which can be decided intel- 
ligently on the basis of the military judgment and experience of the deciding officers. 

In point of fact, Naval Personnel with widely differing backgrounds often exhibit 
surprising unanimity regarding their decisions. On most of the priority and top-anchor 
questions that were asked of several officers, there was complete agreement. Some borderline 
cases might occasionally be called a toss-up by one officer while they would be definitely 
decided by another one; much more rarely, one officer would decide a problem one way while 
another one would decide it the other way. The priority ratings and top anchors determined 
according to the decisions of different Naval Personnel generally differed by no more than 
about .1 or .15. This unanimity should actually not be too surprising. The officers draw on 
their naval experience to answer questions that they consider trivial, but that go a long way 
toward determining the values we are after. It seems that the naval knowledge on which most 
officers can agree is sufficient to determine the values quite closely. 

3. We might mention that certain of the special assumptions we have made should be 
checked before each time that the technique is applied to a new string of requirements. As an 
example, the assumption that the MIP items can be bunched into a small number of priority 
classes whose members have approximately equal priority ratings should be checked. This can 
easily be done by asking a complete set of priority questions (i.e., a set of priority questions 
sufficient to determine the priority rating) for each of a number of activities within a certain 
priority class. Although the Board itself decided on the grouping involved, it might turn out in 
some cases that such a grouping is artificial, and that the priority ratings are more adequately 
represented, for instance, if a few of the ratings are determined individually and some kind of 
curve fitting or piecewise linear method is used for the others. 
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NETWORK ANALYSIS OF PRODUCTION SMOOTHING* 


Te Chiang Hu and W. Prager! 
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The purpose of this paper is to show, on an example, how rather 
forbidding algebra may often be avoided by quite elementary considera- 
tions concerning flows in a properly constructed flow chart or network. 
The problem of production smoothing is incidental in the sense that it 
could have been replaced by other illustrations of the technique. 











INTRODUCTION 

Hoffman and Jacobs [1] and Antosiewicz and Hoffman [2] discussed the following prob- 
lem of production smoothing: a homogeneous product is to be manufactured over a given 
number of equal periods of time; the sales requirements that must be met at the ends of these 
periods are prescribed, and the constant unit costs of production, storage, and change in 
production are known; a production program of minimum total cost is to be determined. The 
first paper contains an explicit solution for monotonically increasing sales requirements; in 
the second paper this is extended to requirements that monotonically increase up to a certain 
time and monotonically decrease thereafter. Johnson and Dantzig [3] generalized the problem 
by considering the costs as known functions of time; they gave a direct method of solution for 
monotonically increasing sales requirements and an iterative method for the general case. 
Bellman [4] showed that the problem could be treated by the methods of dynamic programming. 

The understanding of the methods prescribed in these papers requires greater 
mathematical knowledge than many a potential user may possess. For this reason, an alter- 
native approach is developed in the present paper. A production program is represented by a 
set of flows in a network. A feasible starting program without inventory is gradually improved 
by the superposition of self-contained flow patterns that do not involve any outputs. Rules for 
the efficient use of these flow patterns can be derived with a minimum of mathematics. 


NETWORK AND STARTING PROGRAM 

The suggested network is a chain of identical units. The full lines in Figure 1 show the 
branches of the unit corresponding to the k-th period; the dotted lines indicate how this unit is 
joined to its neighbors. Each branch has a positive sense marked by an arrow; no branch is 
allowed to carry a flow in the negative sense. 





*Manuscript received October 15, 1958. 
The results reported in this paper were obtained in the course of research sponsored by the 
International Business Machines Corporation of New York City. 
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The flow in branch AB represents the labor 
employed during the k-th period; to obtain it, the labor 
employed during the preceding period (flow reaching A 
from the left) must be either augmented by the labor 
hired at the beginning of the k-th period (flow reaching 
A from above) or diminished by the labor fired at this 
time (flow leaving A towards the upper right). The 
small circle at C indicates a servo device that measures 
the flow in AB and pumps an equal flow out of an infinite 
reservoir into the branch CE. The flow in CE repre- 
sents the amount produced during the k-th period, the 
unit being the amount produced by the unit of labor 
during one period. Finally, the flow in DE, stands for 
the inventory during the k-th period. When this is 
augmented by the amount produced in this period (flow 
reaching E from the upper left) and diminished by the amount sold at the end of the period 
(flow leaving E downwards), the inventory of the next period is obtained (flow leaving E 
towards the right). 

In the following, the drawing of networks will be simplified by omitting the symbols for 
the servo devices and letting the origin of each production branch coincide with the origin of 
the corresponding labor branch. Moreover, only those hiring, firing, and sales branches will 
be drawn that carry nonvanishing flows, and all arrows will be omitted. 

For given sales requirements, there exists a unique program with zero inventory for 
all periods. At the beginning of each period, this program calls for just enough hiring or firing 
to provide the labor force needed for the production of the amount that is to be sold at the end 
of the period. Figure 2a shows such a program with zero inventory; Figure 2b gives an alter- 
native program for the same sales requirements. As will be shown later, each of the two 
programs is optimal under certain assumptions regarding the costs of hiring, firing, and 
inventory carrying. 
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The programs in Figure 2 suggest two important remarks. First, the sum of the 
entries in the labor branches must equal the sum of the prescribed sales requirements and 
therefore have the same value for the two programs. Accordingly, the constant wages add up 
to the same total for each program. This means that wages are not a relevant factor in the 
determination of an optimal program. Second, for each program the total labor hired equals 
the total labor fired. This shows that the unit costs of hiring and firing do not enter the dis- 
cussion individually but only by their sum. It follows that the number of relevant cost 
parameters can be reduced to one, namely the cost of carrying a unit inventory through one 
period, by choosing the combined cost of hiring and firing one unit of labor as the unit of cost. 


SELF-CONTAINED FLOW PATTERNS 

If the program with zero inventory is 
not optimal, it can be improved by the super- 
position of self-contained flow patterns that do 
not have any flows in the salesbranches. Fig- 
ures 3a and b show basic patterns of this type, 
while Figures 3c and d show patterns obtained 
by the combination of these basic patterns. -3 -6 -6 -6 -3 

The flow pattern of Figure 3a, for Pd 3 - 

-3 


-3 
instance, reduces the amount of firing at the 3 3 
beginning of its first period by two units, thus (c) 3 


A 
Ne 
Oo 3 
increasing production in this period by two 


units and inventory during the next period by 
the same amount. Since a self-contained flow 
pattern must not have any flows in the sales 
branches, this additional inventory must either 
be carried through further periods or, for a 
pattern of minimum length, be cancelled by a 
corresponding drop ofproduction inthe second 
period. Accordingly, a self-contained flow 
pattern extends atleast overtwoperiods. Pat- 
terns of this minimum length will be termed basic; patterns of greater length can be regarded 
as resulting from the combination of basic patterns and will therefore be called combined 
patterns, 











Figure 3 


When the pattern of Figure 3a can be superimposed on a program without leading to 
negative flows in any branch, it reduces the total hiring and firing by four units each and 
increases the inventory by two units. The saving achieved by this superposition thus equals 
2(2-a). Similarly, the superposition of the pattern in Figure 3b results in a "saving" of 
-(1+), i.e., this superposition does not improve the program. 

Figure 3c is obtained by multiplying all flows in Figure 3a by 1.5 and repeating the 
resulting pattern to cover four consecutive periods. The pattern in Figure 3c can be super- 
imposed on the program of Figure 2a, beginning with the second period. The saving achieved 
by this superposition equals 6(2- a); the resulting production program coincides with that of 
Figure 2b for the first five periods. 

Whereas Figure 3c is obtained by placing two patterns in juxtaposition, Figure 3d 
results from combining the patterns in Figures 3a and b so that the last period of Figure 3a 
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overlaps the first of Figure 3b. If the flow 
pattern in Figure 3d can be superimposed 
on a program without leading to negative 
flows in any branch, a saving of 3(1- a) can 
be achieved. 

Since the self-contained flow patterns 
will be used to improve an initial program 
without inventory, only such patterns need to 
be considered that involve positive invento- 
ries. Figure 4 shows all basic patterns that 
involve a unit inventory in one period. For 
easy reference, these patterns are given the 
labels A, B,; Bo, C;, Co, D,, Do, and E, 
and the "saving" achieved by each is noted 
following the label. Only patterns A, By, 
and Bo can achieve actual savings. By 
themselves, the other patterns would 
increase cost, but depending on the value 
of a they may be combined with A, Bi, and 
Bo to produce some saving. 

In the following a combination of pat- 
terns will be specified by listing the sequence 
of its components, a comma between suc- 
cessive components indicating juxtaposition, 
and a semicolon overlap by one period. Thus, 
the pattern in Figure 3c is specified by the 
symbol 1.5A, 1.5A and that of Figure 3d by 
the symbol A;D,. 

An optimal program cannot involve 
both hiring and firing at the beginning of the 
same period. Since the initial program without inventory is subject to the same restriction, 
only such combinations of patterns need to be considered that do not require simultaneous 
hiring and firing. The juxtaposition of patterns A and E, for instance, or the overlapping of 
patterns A and B, by one period are ruled out by this condition, as is the overlapping of 
different basic patterns by two periods. Also, except for C; and Co, no basic pattern can be 
combined with itself with an overlap of one period. 

The directed graph, Figure 5 indicates all possibilities of combining basic patterns with 
an overlap of one period. The vertices of the graph stand for the various basic patterns and 
the directed arcs for the possible combinations, the arrows indicating the order in which the 
pattern at the vertices of the considered arc must be combined. Each vertex bears the label 
of the corresponding pattern followed by the "saving" achieved by its use. The labels C, and 
Co are circled to indicate that each of these patterns can be combined with itself. Any path in 
the graph represents a possible sequence with overlaps of one period between successive pat- 
terns. Each of the patterns in such a sequence may be used with an arbitrary positive factor 
provided that the resulting combination does not produce negative flows when it is super- 
imposed on the program that is to be improved. Except for C, and Co, each vertex of the 


Figure 4 
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A:2-a Cee E:-2-a 











Or @-s 


Figure 5 


path stands for a single use of the corresponding pattern; the vertices C, and Co, however, 
may represent single or repeated use of these patterns. 

It is readily verified that the saving achieved by a combination of patterns is the sum 
of the savings of the individual patterns used in the combination. This statement applies to 
patterns in juxtaposition as well as overlapping patterns. 

The savings listed in Figures 4 and 5 show that the following cases must be considered: 


Case I: 
Case II: 
Case II: 


Case I: 


@ > 2; 
>a>l; 
>a>d. 


a@ > 2. In this case, none of the basic patterns of Figure 4 achieves a genuine 





saving. On account of the additivity of the savings of combined patterns, this 
means that there exists no combined pattern involving only nonnegative 
inventories that achieves a true saving. Accordingly, the initial program 
without inventories is the optimal program. 


2 >a > 1. In this case, A is the only basic pattern that achieves a genuine 





saving. Accordingly, the other patterns should not be used by themselves 
but only in combination with A. Consider, for instance, the combination 3A; 
D)> which extends over three periods and saves 5-4a. By itself, the first 
component 3A would save more. The only possible justification for the use 
of the second component D, could be that the use of 3A alone would produce 
either negative hiring at the beginning or negative firing at the end of the 
second period. 


Assume first that D, is introduced to avoid negative hiring at the beginning of the 
second period. It is then convenient to treat 3A; D, as resulting from the superposition of 
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2.5A; 0 and 0.5A; D,, where the zero in the first pattern is added to indicate without ambi- 
guity the relative position of the two patterns. The pattern 2.5A; 0 reduces to zero the amount 
of hiring that the initial program prescribed for the beginning of the second period, and the 
pattern 0.5A; D, avoids the occurrence of negative hiring at this time as more of pattern A 
is used to bring about further changes in the program. Since, however, the "saving" achieved 
by the latter pattern is negative, viz. -3a/2, it is preferable not to use this pattern at all. 
In other words, the use of 2.5A; 0 is preferable to that of 3A; D;. (Note that the validity of 
this statement does not depend on the value of the positive parameter @.) 

Assume next that D, is used to avoid negative firing at the end of the second period. 
The pattern 3A; D, must then be considered as resulting from the superposition of A; 0 and 
2A; D;, the latter involving no firing at the end of the second period. For the considered range 
of a (2 >a@ >1), the "saving" achieved by the use of the latter pattern, viz. 3(1-a@), is nega- 
tive; this pattern should therefore not be used at all, and A; 0 is preferable to 3A; D;. 

In a similar manner combinations in which Do precedes A and combinations of A and 
E can be ruled out. It follows that the initial program without inventory can be transformed 
into the optimal program by the sole use of pattern A. The only way in which this pattern can 
be combined with itself is by juxtaposition. A sequence of patterns A in juxtaposition involves 
alternating negative firing and hiring; it begins and ends with negative firing. This means that 
any break in a sequence of alternating firing and hiring in the initial program marks an end of 
a sequence of combined patterns A used in the transition to the optimum program. These 
breaks divide the initial program into parts that can be treated separately. 

The initial program of Figure 2a has an uninterrupted sequence of firing and hiring, 


beginning with firing at the end of the first period and ending with firing at the end of the last 
period. To improve this sequence, begin at the left and progress towards the right, super- 
imposing, at each stage, the highest positive multiple of A that will not cause negative hiring 
or firing. In the first stage, for instance, 3A reduces the hiring at the start of the second 
period to zero and leaves three units of firing at the end of the second period. Thus, 3A must 
be used in the second stage. Proceeding in this manner, one obtains the optimum program of 
Figure 2b. 


Case Il: 1>a»>0. As a drops below unity, pattern A ceases to be the only one 
achieving a genuine saving. Patterns B, and Bo produce savings for a > 1 
and do not cause any loss for a = 1. Patterns A, B,, and Bo can therefore 
be used by themselves, but the other patterns should only be used in com- 
bination with one or more of these advantageous patterns. The number of 
combinations that must be discussed increases as @ decreases. 





To indicate typical arguments used in this discussion, assume that 1 > a > 2/3 and 
consider again the pattern 3A; D,- As has been shown previously, it does not pay to use D; to 
avoid negative hiring at the beginning of the second period. If, on the other hand, D, is used 
to avoid negative firing at the end of the second period, the pattern 3A; D, must be treated as 
resulting from the superposition of A; 0 and 2A; D,- Since now a > 1, the second pattern 
achieves a genuine saving, viz. 3(1- a@). The second pattern is however found to be identical 
with 2B,; Bo, and only one of these identical patterns needs to be retained in the list of 
advantageous patterns. 
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The complete discussion of the possible combinations of basic patterns reveals that 
only the following patterns need to be considered: A (saving 2-qa), By; Bo (saving 2- 2a), 
and By or Bo (saving 1-a@). Note that it is not possible to form new patterns by combining 
any of these patterns with an overlap of one period. When the patterns are strung together in 
juxtaposition, there will never be a sequence of two firings or four hirings. Accordingly, the 
occurrence of a sequence of this kind in the initial program marks a break, which divides the 
program into parts that can be treated separately. Another type of break occurs when the 
initial program has two consecutive "joints" at which neither hiring nor firing takes place. 

The initial program without inventory shown in Figure 6a does not exhibit any of these 
breaks and must therefore be treated as a whole. An immediate improvement is obtained by 
the superposition of the sequence 2A, Bi; Bo; Bi; Bo beginning with the second period in 
Figure 6a. It is then seen that a further improvement results from the superposition of 2Bo 
beginning with the fifth period. Finally, the pattern 2B); Bo can be superimposed on the last 
three periods. The resulting optimal program is shown in Figure 6b. It is worth noting that 
the value of a is relevant to this discussion only to the extent to which it restricts the patterns 
that may be used for the improvement of the initial program without inventory. Thus the 
program of Figure 6b is optimal for any @ in the considered range. 
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FURTHER NOTES ON THE LOSS RESULTING FROM THE USE OF 
INCORRECT DATA IN COMPUTING AN OPTIMAL INVENTORY POLICY* 


Joel Levy! 


Bureau of Supplies and Accounts 





In the Naval Research Logistics Quarterly of March 1958, the author 
published some exploratory research on ''Loss Resulting from the Use 
of Incorrect Data in Computing an Optimal Inventory Policy.'' The 
present paper continues these investigations by examining the effect of 
errors in the loss function (Part I) and in the discount rate (Part II). 











INCORRECT PENALTY FUNCTION 

In the following an algebraic expression will be derived for the increase in the cost of 
an inventory policy that results from a given error in the loss function. The ordering costs 
and penalty for depletion are assumed directly proportional to order quantity and shortage 
quantity respectively. 


1. Description of the model and the equation for the cost of the optimal inventory policy 


When order costs and penalty for depletion are both linear, the cost of following an 
optimal inventory policy is: 





ee re) 
f(x) = Min [kiy-m os { f p(s - y) g(s) ds + f(0) J g(s) ds 


y2x 


y 7 
‘ J tly - 8) g(s) as | 


In reference (a) the optimal policy is shown to be: 


for x<xX order X - x 


for x >X do not order, where x is the value of y satisfying the equation: 





—é *Manuscript received 29 May 1958. 
<=. 


tThe author wishes to thank his colleagues of the Research and Development Division and is 
(", §rateful in particular for the encouraging supervision of Mr. J. R. Simpson, Jr. 
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We shall also need the following facts from reference (a): 


© % 
kx +ap f (s - x)g(s) ds - ak J (x - s) g(s) ds 
x oO 





= l-a 


for x <x f(x) = f(o) - kx. 


2. Cost of the optimal inventory policy with incorrect penalty function 
If the penalty per unit shortage is actually p, but is incorrectly assumed to be p, then 
the cost of following the optimal inventory policy for x = 0 will be: 





(3) x' , 
kx'+ap [ (s - x') g(s) ds - ak / (x' - s) g(s) ds 
= oO 


(1 - a) 





fp" (0) = 


where x' is the y satisfying 


f id de = 22-* 
.* ~ a@-k) 


The proof of this is similar to that found in reference (a) p. 161 giving the expression 
for {(0) which we transcribed in paragraph 1. The difference is that now, since our policy is 
derived assuming p is the unit penalty for shortage,x' is the stockage objective instead of x. 
Carrying through the argument in full we get: 


ra © 
ip =k 2) o0fp [ (s - x') g(s) ds + £5, (0) [ g(s) ds 


“x 


wr 


x 
+ J {5 (&' - s) (s) esl. 


In getting the expected cost of this policy, we use the true penalty for shortage and not 
the one incorrectly taken as the basis for computing the policy. Also 


f(x) = -k or f5 (x) =f (0) - kx. 


Substituting the last expression of f p*) in the right side of the above equation we get: 
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x' 


[> ¢) 
ipedee aaa | p(s - x’) g(s) ds + £5, (0) - J XG - 9) (6) a. 


Andfor x =o this yields: 


re) x' 
kx'+ap [ (s - x') g(s) ds - ak / (x' - s) g(s) ds 
x' oO 





fz; (0) = ee 


For 0<x<*x', {5 (x) = fs (0) - kx. 


3. Loss due to incorrect data on penalty function 
For x not greater than the smaller X or X' we have: 





f5 (x) - f(x) = fs (0) - f (0) 


1 x 
“ k(x' - x) +ap [ sg(s)ds 


@ x 
- ap (x' - x) [ g(s) ds+anx J g(s) ds 


= — ww 


x x x 
- ak (x' - d kx d k 
ak (x' - x) / g(s)ds+akx i g(s)ds+a i 


Oo 


sg (s) es| 


1 al 
phftor 9-20 j sg (s)ds 


a <0 x 
raxp-m) [ g(s) ds - ap (x' - x) i g(s)ds-ak@-» J c(9 4a} 


1 wd 
-y{a-oxe 9-20-29 [ sg(s) ds 


x' 2) 
+ ax (p - k) [ g(s) ds - a (p - k) (x' - x) [ cl as}. 
x x 
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4. Two equations used to simplify the expression in paragraph 3 
We now point out two results that we shall need in simplifying the last expression for 
the difference {5 (0) - f(0). 








(a)_ap-k _ ap-k _ (ap -k) (@-k) - (ap- k) @-k)_ (i-a)k @-p). 
a@-k) a(p-k) a(p - k) (p - k) a(p - k) (p - k) 


(b) By the first theorem of the mean for integrals: 


= x 
[ sg(s)ds=u [ g(s) ds where u is between X and x’. 
x x 


(See for instance, Graves, Lawrence M. The Theory of Functions of Real Variables, 
2nd Edition (McGraw-Hill Book Company, Inc., 1956) p. 96.) 





5. Simplified expression for loss due to incorrect data on penalty function 
We can therefore rewrite the final formula in paragraph 3 as follows: 





£5 (0) - £(0) = {0 - a) k(x' - ®) 
1 


x' 6) 
- a(p - k) (u - x) [ g(s) ds - a (x' - xX) (p - k) [. el) a. 
x 


Substituting for the expressions under the integrals we get the above equal to: 





1 {a- anor -9-2@- NEDO -9eG-D 
l-a a(p - k) (p - k) 





_ a(x' - x) (p - k) (1 - a) k 
a(p - k) 





~n{@r-n)-2-2 0-0. 6-2-0 
6 -¥ @ -*) 


Sanam (@-9[6-W-@-»]-«-26-n| 
p-k 





-k@-p)@'-u)_ 
p-k 


This last expression is positive provided p > k. But in view of the economic meaning 
of the symbols we are only interested in cases where the possible penalty for shortage exceeds 
the unit price of the commodity. If p > p then X'> u> X andif p> Dp then xX >u> x’. 





LOSS RESULTING FROM THE USE OF INCORRECT DATA 


INCORRECT DISCOUNT RATE 

In the following an algebraic expressed will be derived for the increase in the cost of 
an inventory policy that results from a given error in the discount rate. The ordering costs 
and the penalty for depletion are assumed directly proportional to order quantity and shortage 
quantity respectively. 


1. Description of the model and the equation for the cost of the optimal inventory policy 
The model we use and the equation for the cost of the optimal inventory policy is the 
same as that in paragraph 1 of the preceding section. 





2. Cost of the optimal inventory policy with incorrect discount rate 
If the appropriate discount rate is a, but is incorrectly assumed to be 4a, then the cost 
of following the optimal inventory policy for x =o will be: 





x' 


@ 
kx' + ap . (s - x') g(s) ds - ak | (x' - s) g(s) ds 
x o 





ad l-a 


where x' is the value of y satisfying 


y = 
J et0 as 6=Hy° 
ry) 
For o<x<*x', fz (x) = £5 (0) - kx. 


The proof is practically identical with that given in paragraph 2 of the first section and, 
therefore, need not be repeated here. 


3. Loss due to incorrect discount rate 


Using the argument of Part I, paragraphs 3 to 5, we find that for x not greater than the 
smaller X or x': 





f(x) - £(x) a {a - a) k(@'-%) 
1 


x' i) 
- a(p - k) (u- x) [ g(e) da-aG-RO-K | sc as, 
x x 


where u is given by the equation: 


x x 
u L g(syas= [ sg(s)ds. 
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4. Equations to simplify the expression in paragraph 3 





(a) ap-k _ ap-k _a(ap-k)-a(ap-k)_ (a-a)k 
a(p-k) a(p-k) a a (p - k) aa (p - k) 





(b) ap-k _(1-a@k 
“Gip-k) a(p-k) 


ie, 1 
Toi ™ ** TGF 





Setting a= we can simplify (a) to: 


1 1 
lt wai _(1+i-1-i)*_(@-i)k 


Fesd Fess kame ® -k) (p - k) 


(a') 





And we can simplify (b) to: 


" ow ik 
-_(p-k) p-k 


1+i 





5. Simplified expression for loss due to incorrect discount rate 
Substituting (a') and (b’) in the appropriate parts of the formula of paragraph 3 we get: 





«@ 


f= (x) - £(x) = 1 {a-ak - 
1 





SMe D ODE ae aE Wik) 
(p - k) (p - k) 


=) {(1-a)k@-B-a(u-¥ (i-T)k- a - Dik} 


l-a 


= kG -@- {((u-% (i-1) 47-3) 


-k@-»-—2* (u-Hi+T@ -w)}. 


l-a 


Substituting a = i : we get the above equation equal to: 


+i 
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k= 9) -= {(u- %)i+T @'- w)} 


=k @'- 3) -k(u-D- kA -w=k GR -w) ja -2]. 
i 


It is comforting to note that the last expression is positive when a 4a. If a> a, then x' >u >x 
and i> i; so both factors are positive. If @ <a, then X > u > X' and i > i; so both factors in 
the final expression are negative. 
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A CONVEX PROGRAMMING PROCEDURE* 


D. A. D'Esopo 


Stanford Research Institute 





The procedure whichinvolves successively minimizinga convex function 
with respect to each ofits coordinates is showntoconverge if constraints 
are rectangular and the function has continuous derivatives. It is also 
shown that certain more general procedures, which might be expected 
to converge more rapidly, are convergent. A criterion is exhibited 
which yields information concerning the rate of convergence of this 
method. 











The procedure described below has been studied in connection with convex quadratic 
functions by C. Hildreth [1]. We shall here consider the application of this method to mini- 
mizing convex functions in general. In addition, the difficulty noted in [2] is remedied. 

The method is briefly the following: Let a real valued function ¢ be defined for a set S 
of points in N-dimensional (real) space. Let Zo be an arbitrary point of S. Let Zy be any 
point of S which differs from zp at most in the value of its first coordinate, but which mini- 
mizes @ among all such points. Let Zo be any point of S which differs from Z, at most in the 
value of its second coordinate but which minimizes ¢ among all such points. Continue this 
process, defining Za, Z4,+++s Zp by holding all coordinates except one fixed at previously 
determined values and varying the 3rd, 4th, ..., Nth coordinate so as to minimize ¢. Continue 
the process, defining 2N41 by minimizing again with respect to the first coordinate, and so on 
ad infinitum. ! 

More precisely, the sequence {z,} is obtained by selecting Zo arbitrarily in S. Then, 
having chosen zy? let Ch be the set of all points of S which differ from Zn at most in the value 
of their (n (mod N) + 1)th coordinate, and select Zn from Ch in such a way that 


o(z_.,) = min 9(z). 
n+l z°C, 


Intuitively, it would seem that the sequence of points so obtained might tend to some 
point of S at which ¢ has its minimum value, if such points do indeed exist. Or failing actual 


convergence, it would at least seem hopeful that the sequence {¢ (z,,)} might tend to ¢ in? the 





*Manuscript received June 20, 1958. 

lIt must be assumed that such selections are possible, that is, that such points which minimize 
$ do indeed exist. However, these points do not need to be distinct. Indeed, they may be all 
identical from some point on. 
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infimum of $(z) for z in S. Conditions under which these conjectures hold are given in a 
theorem stated below. Our attack begins with the study of a certain class of transformations 

of S into itself. This class will ultimately be shown to include the coordinatewise minimization 
transformations which are essentially involved in the foregoing. 


Condition A(P, ¢, S) 
The triple of entities (P, ¢, S) will be said to satisfy the condition A(P, ¢, S) if 





A.1. S is a subset of some finite dimensional real space, P is a continuous transfor- 
mation of S into S,¢ is a real valued function defined and continuous for all points of S, 


A.2. @ (P(z)) < ¢(z), 
A.3. If ¢ (P(z) ) = ¢(z) then z= P(z). 


We first show the following "fixed-point"' lemma. 
LEMMA 1: If A(P, ¢, S) and if the sequence {z,} satisfies 
o(z,,4) < o(P(z,)), 


with Zz, any point of S, then at any point z of S which is an accumulation point of {z,} 
we have P(z) = Z. 


(By an accumulation point of a sequence {z,} is meant a point with the property that, to 
every sphere (or segment) of which it is the center and to each positive integer k there cor- 
responds at least one positive integer m > k such that the mth term of the sequence {z,} lies 
in the sphere. Throughout this paper much use will be made of the fact that if f is any function 
continuous over a subset E of N-dimensional real space having values in a subset F of M- 
dimensional real space, and if Z is any point of E which is an accumulation point of {z,}, a 
sequence of points of E, then f(z) is an accumulation point of {f(z,)}. To prove this, suppose 
to the contrary that f(z) is the center of some sphere V which contains no terms of the 
sequence {f(z,)} beyond the mth. The continuity of f guarantees the existence of a sphere U 
centered at Z such that whenever x is both in U and in E, then f(x) is in the sphere V. It 
follows that no term of {z,} beyond the mth lies in U, and hence that z is not an accumulation 
point of {z,}, a contradiction.) 


PROOF OF LEMMA 1: The conditions 1.1 and A.2 together imply 


(1.2) (Zz, ,4) < o(P(z,)) < o(z,). 


From this we learn that the sequences {$ (z,,)} and {¢ (P (z,) )} are both monotic decreasing ° 
and both have the same greatest lower bound. Since ¢ is continuous, ¢(Z) is an accumulation 
point of {¢(z,)}. Since P is continuous P(Z) is an accumulation point of {P(z,)} and hence 
@(P(z)) is an accumulation point of {(P(z,) )}. But, if a monotone decreasing sequence has 
an accumulation point, this point must also be the greatest lower bound of the sequence. It 
follows that ¢(P(z)) = (Zz), and, because of A.3, P(z) = Zz. 
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If P, and Py are transformations of § into itself, the composite transformation 
(Py Py) is defined by the equation 


(PP) (z) = Po (P, (z)) . 


The composite of N transformations P,, Po, eaters Py which we shall denote (Py Py-1 arr P,); 
is similarly defined by successive applications of the transformations P,, Po, ween Py-1 and 
Py: We next prove a lemma about composite transformations. 


LEMMA 2: If both A(P,,¢, 8) and A(P5,4,S), andif P =(P,P,), then A(P, ¢, S). 
Moreover, if, in addition, P(z) = z then P, (2) z= P, (z). 


PROOF: By a well-known theorem, P is a continuous transformation of S into S, thus 
establishing A.1. But 


(2.1) ¢(P(z)) = o(Py(P,(z)) < o(P,(z)) < (2), 


which establishes A.2. 

Next, assume ¢(P(z)) = $(z). Then we may use 2.1 to see that $(z) =¢ (P, (z)), so 
that by A.3 for P, we have z = P, (z). Hence, (again using the assumption ¢(P(z)) = ¢(z) ) we 
have $(z) = (Po (P, (z))) = ¢(Py(z)), from which by A.3 for Py we have z = Po (z). That is, 
the assumption ¢(P(z)) = ¢(z) implies Pi (z)=z= P, (2) which implies z = P(z) so that A.3 
has been established. The above argument, being based on the assumption ¢(P(z)) = ¢(z), 
applies a fortiori when P(z) = z to prove P, (z)=z= Py (z). 

Lemma 2 may be generalized by induction to the composition of N, transformations 
Py, Po, pees Py each satisfying condition A. 


LEMMA 3: If P,, Po, eieKeis Py are transformations such that A(P,, ¢, S) holds for 
each i,i=1,...,N, andif P= (Py Pyiy-:: P,) then A(P,¢, 8S). Moreover, if, in 
addition, P(z) = z, then P; (z) =z for each i, i=1,..., .N. 


THE CONTINUITY OF COORDINATEWISE MINIMIZATION 

We now return to specific consideration of N dimensional space Ew and prove the 
continuity of the coordinatewise minimization used in the Hildreth procedure. To state the 
following lemmas, it is convenient to introduce some notation. 

If z is a point of Ey we will denote the value of its jth coordinate by [z], j 

A subset S of Ey will be called rectangular if there exist N-tuples, a andf, of (pos- 
sibly infinite) real numbers such that 


[2], <[2]) <(]; i=1--..N 


is satisfied for each point z of S, andif S includes all points satisfying the inequalities. 
The set C(z,S,i) consists of all points x of S for which the N-1 equations 


[2]; = [x], S@ by case MOS 8 
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are satisfied. (Thus z « C (z,S,i).) This set consists of those points of S which differ from z 
at most in the value of their ith coordinates. 

A function ¢ will be called convex’ over § in its ith coordinate if it is a continuous 
convex (but not necessarily strictly convex) real valued function, defined over S, and if for 
each point z of S, and for each i, i=1,...,N,% has a unique minimizing argument in the 
set C(z,S,i). 

Finally, we let T(¢,S,i) be the transformation of S into itself which associates to each 
ze S, the unique ¢« C(z,S,i) which minimizes ¢ among all points of C(z,S,i) (assuming that 
such ¢ exists). 





We first prove a simple property of rectangles. 


LEMMA 4: If S is a rectangle, x and y are points of S such that ye C(x,S,i), and if 
{x, } is a sequence of points of S which tends to x, then the sequence {y, } defined by 
(Yn) j = [x]; for j + i and [Yai = ly]; converges to y. 


PROOF: It is clear that 
[“]; S [Yn]; s 6], 
for j +i since for such j, MANE = [x,];- But the same inequality holds if j = i since 


[ynli = [y]; and yr S. Hence {y,,} is a sequence of points of S. Moreover, it is clear 
that Yn ° C(x,,S, i). The fact that {y,} tends to y is a simple consequence of the fact that the 


distance from y, to y is less or equal to the distance from x, to x, and this in turn is due to 
the fact that [y,, - y); = [x, - x]; for j +i and [y, - y]; = 0. 


Next, we show the desired continuity of the minimization transformations under a 
fairly general set of hypotheses. 


LEMMA 5: Let S be a rectangle, ¢ be convex' over § in its coordinate, and 
P = T(¢,S,i). If, in addition, either S is bounded, or if for each finite real number r - 


the set of points x at which (x) < r is bounded, then P is a continuous transformation 
of S into itself. 


PROOF: Let z be any point of S. The hypotheses on ¢ make P well-defined for z. 
To prove its continuity, we let {z,} be any sequence of points of S tending to z and let 
P(z,) =x, and let P(z) = y. Suppose that the sequence {x,} has a point of accumulation x + y. 
By extracting subsequences first from the sequence {x,} and then the corresponding subsequence 
from {z,,}, we may obtain sequences {x',,} and {z',} tending to x and z respectively and such 
that ¥. = P(z',)- There is, in consequence, no loss in generality if {x,} is assumed to tend to 
x (instead of merely having x as an accumulation point). We first observe that x« C(z,S, i), 
for, when j + i, [x,]j = [P (z,) F = [2,1]; , and the latter tends in n to [z], . In other words, 
[x], = [2], , for j + i so that indeed xe C(z,S,i). We now see that the hypotheses of lemma 5 
are satisfied, that is {x,} is a sequence of points tending to x, and ye C(x,S,i) (this last since 
both are in C(z,S,i).). Hence we choose a sequence {y,} so that lim y, = y and 
y, © C(x,,8, i). li 
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Observe that we have now three sequences: {z,}, {x,}, and {y,}- These tend to z, x, 
and y respectively. Note also that Zy> Xp and y, are all in the same "C" set, namely 
C(z,,8, i), and that z, x, and y are in the same "C" set, namely C(z,S,i). But P(z,) =X) 
so ¢(x,) < ¢ (y,,)- Hence, since ¢ is continuous, ¢(x) <¢(y). But also P(z) =y so that 
¢(y) <¢(x). This makes ¢(x) =¢(y). The assumption that x + y has implied that ¢ has a 
nonunique minimum in the set C(z,S,i), and this is a contradiction. Thus, the sequence {x,} 
may have at most one point of accumulation, namely y. That it has a point of accumulation is 
a consequence either of the closedness and boundedness of §S, or of the set of points x such 
that $ (x) < me ¢(z,) which is a finite real number because ¢ is continuous. 


To summarize: If z is any point of S, and {z,} is any sequence of points of S tending 
to z, then the sequence {P(z,)} = {x,} tends to the point P(z) = y. This condition is equivalent 
to the continuity of P. 


CONVEX FUNCTIONS WITH CONTINUOUS PARTIAL DERIVATIVES 

We next show some properties of convex functions with continuous partial derivatives. 
These are proved by Allen [3] and are special cases of results appearing elsewhere, for 
example, Frank and Wolfe [4]. 


LEMMA 6: If ¢ is a convex function over the convex set S in E 
first partial derivatives, and if 


nao= i [c- 2z],, 


N having continuous 


then for each z, (cS, 
@(¢) 2 (z) + h(z, 2). 
PROOF: For any z, ¢«S and anya such that 0 <a<1l, 


(1 - a) $(z) +ag(%) >o((L- a)z+at), 


(ct) - o(z) > {o(z + a(t - z)) - o(z)} ea. 


Since ¢ has continuous first partial derivatives the right-hand side of the above inequality may 
be written as 


{v0 + > aa - 2],-% + (a(t - 2) - va a 
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Star ~[e- 2y + fale 2)) +a, 


where the latter term tends to zero when a({- z) does. 
Thus allowing a to tend to zero we have 


(¢) > o(z) + h(z, 2). 


LEMMA 7: If ¢ is a convex function which has continuous first partial derivatives in 
the rectangular subset S of Ey and if P; = T(¢,S,i) for each i, i=1,..., N, andif 
Z is any point of S such that P; (2) =% for each i,i=1,..., N, then both 


26) 
L, 2 [2], [¢ - z], >0 


for all ¢ « S and Z minimizes ¢ over S, that is, ¢(%) >¢(Z) for each ¢ «© S. 


PROOF: Suppose for some /« § the term 


9¢@ 1, - 
9 [2]; 


= : 0 
were negative. This could occur only if uP were positive and [Z - Z]; negative or > 
z\; 
i 
negative and [¢ - Z] i positive. Suppose, for instance, the former is the case. The negativeness 
of [¢- 2]; implies that there exists in S a point whose ith coordinate is less than that of Z. 
But since S is rectangular, there is consequently a point in S (say ¢'), whose ith coordinate is 
less tan that of z, but which is equal to Z in every other coordinate. Thus, since or is 
d [z}. 
i 
positive, ¢(Z) decreases as Z moves toward ('. This implies that P; (z) + Z. The other case 
is quite similar. 
Accordingly, 


06 (z 
h(z,¢) = = sa [¢ - Zz], 20 


over S, and from the previous lemma we have 


o(¢) > @(z) forall ce S. 
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THE CONVERGENCE OF THE PROCEDURE 
The results of lemmas 1, 3, 6, and 7 are summarized in the following theorem. 


THEOREM: If the following conditions hold: 
1. S is a rectangular subset of Ey, 


2. ¢ is convex' over S in each coordinate, 


3. Either S is bounded, or for each finite real number r the set of points x at which 
¢ (x) < r is bounded, 


4. ¢ has continuous partial derivatives in S, 


Further, if the transformations P; of S into itself are defined by Pi = T@,S, i), if 

P= (Py Py_, --- Py), if {z,,} is any sequence of points of S such that {¢(z,)} is monotonic 
decreasing, and which has an infinite subsequence {z',} satisfying ¢ (z', +D <o (P(z',)) and, 
finally, if M is the set of accumulation points of {z,}, then the following conclusions hold: 


1. M is not empty, 
2. Every Z«M minimizes ¢ over the set S, 
3. M has only one point if ¢ is strictly convex, 


4. {o (z,)} monotonically decreases to ¢ 5, = _ ¢ (x), 


. If S is bounded, 


N 
26 (2) 
tS do a (zl, me oy 


is an upper bound for the difference (z,) -¢ miter and this upper bound tends to 
zero in n, 


. The above-mentioned bound is (when S is bounded) represented by 


N, 26 (z,) 
fu]; 


[u]; =* 





| en - a]; when 


where @ and determine the rectangle S as in the definition of rectangle. 
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PROOF: Since the sequence {¢ (z,)} is monotonically decreasing, the sequence {z, } 
lies in both the set S and the set Q of points x at which ¢(x) < ¢ (Zo), and this last is a finite 
real number. The closedness and boundedness of either S or Q given in hypothesis 3 implies 
M is not empty, that is, conclusion 1. Hypotheses 1 through 3 are those of lemma 5 (for each 
i, i=1,...,N) so it follows that Pi is a continuous transformation of S into itself for each 
i,i=1,...,N. Since the uniqueness of the minimum resulting from hypothesis 2 implies 
o(P; (z)) < @(z) except when P;(z) =z, for each i, i=1,..., N, we have condition A(P,,¢,S) 
satisfied for each i, i=1,...,N. It follows from lemma 3 that A(P,¢,S). The reasoning 
which shows M is nonempty establishes the existence of a point of accumulation z for the 
sequence {z',}- It follows from lemma 1 that P(Z) = Z, and hence from lemma 3 that P; (2) =Z 
for each i,i=1,...,N. The fact (hypothesis 4) that ¢ has continuous partial derivatives 
permits us to use lemma 7 to see that Z minimizes ¢ over the set S, that is, ¢(z) = —_~ ¢ (x). 

Xe 


But observe that, since ¢ is continuous, the value of ¢ at any point of M is an accumulation 
point of {¢(z,)}, and hence equal to its greatest lower bound. Thus, ¢ is constant over M. It 


follows, since Z« M, that the value of ¢ at any point of M is min ¢(x). We have thus proved 
xeS 


conclusion 2, and, incidentally, since a monotone decreasing sequence tends to its greatest 
lower bound, have also proved conclusion 4. 

Conclusion 3 is obvious. 

If 


t) 
h(z, t) = a am 7 Z|; > 


we have by lemma 6 


@(¢) > o(z,) + h(z,,¢). 


¢ min 2 ¢ (z,) + min h(z,, ¢) ‘ 
teS 


on {-h(z,, 6)} > (2) - onin » 


thus establishing the bound of conclusion 5. 
We must now show that this bound tends to zero. Suppose for any z we let 


H(z) = max {-h(z,¢)}. 
teS 


A theorem of linear programming allows us to rewrite this as 


H(z) = ape {-h(z,¢,)} , 
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where ¢, is the kth vertex of S. It follows that, since each of the functions -h(z, Cy) isa 
continuous function of z, H(z) is also a continuous function of z. Since for Z«M, by lemma 7, 
h(z, ¢) 20, we have -h(z¢,) < 0. But h(z, z) =0. Consequently H(z) = 0 when Z<M. 

Let r be any (possibly infinite) accumulation point of the sequence H(z, ) . Corresponding to 
the subsequence of {H(z,)} which tends to r there is a subsequence {z',} of the sequence {z,}. 
The closedness and boundedness of Q or S yield an accumulation point z of this subsequence, 
which must be in M since it is a point of accumulation of the original sequence as well. But as 
H is continuous, H(z) is a point of accumulation of the sequence {H(z',)}. The latter has at 
most one point of accumulation, namely r. Hence H(z) = r= 0. We have shown that 0 is the 
one and only accumulation point of the sequence {H (z,)}- Hence the sequence tends to zero. 
This completes the proof of conclusion 5. 

Conclusion 6 is obvious. 

Remarks: Sequences {z,} of the type described in the theorem above may be obtained 
in many ways, the simplest being the method indicated in the opening paragraphs. In general, 
such sequences are obtained by starting at an arbitrary initial point, performing one or more 
"cycles" of successive applications of the transformations P,, Po, ae Py then performing 
arbitrary » reducing transformations (for instance, using gradient methods), then more cycles 
of the P's, then more optional transformations, etc. The theorem therefore tells us that if 
various "convergence accelerating" transformations are inserted at irregular stages into the 
basic procedure, the convergence of the procedure is at least not destroyed. 

The scope of the theorem may be delineated more carefully as follows: Let {T,} be an 
infinite sequence of transformations such that ¢ (T,, (z)) < @(z), and let the sequence {z,} be 
defined by Zn41 = 7, (z,)- Then the sequence {z,} satisfies those hypotheses of the above 
theorem which relate to it if the following condition is met: 

For each m there exists k >m such that 


Ty = Pys Ty. = Por e+e Thynea = Pn- 


If the T's are random selections of the P;'s, this condition is met with probability 1, 
so that one of the alternative methods suggested by Hildreth in section 6 of [1] is seen to con- 
verge with probability 1. Also suggested in [1] is another method, namely: select for mm that 
transformation, P;, which gives the greatest reductian in ¢. The above theorem cannot be 
applied to the sequence so generated. However, the continuity of the transformations P; readily 
guarantee the conclusions of the theorem, 

It remains to be seen whether any sequence {T,,} such that each t.* Pi for some i, 
and such that each P; = T,, for an infinite number of n's yields a sequence {z,} for which the 
conclusions of the above theorem hold. 

Hildreth [1] shows how to reduce general linear constraints to rectangular ones. 
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AN ALGORITHM FOR SOLVING THE TRANSPORTATION PROBLEM 
WHEN THE SHIPPING COST OVER EACH ROUTE IS CONVEX* 


E.M. L. Beale 


Admiralty Research Laboratory, Taddington, Middlesex, England 





An algorithm is presented for solving the transportation problem 
when the shipping cost over each route is a convex function of the 
number of units shipped by this route. 


The algorithm can also be used when the shipping costs are linear, 
whether or not there are capacity restrictions on the number of units 
that may be shipped over each route. But it does not seem to be any 
better than other standard methods for these problems, such as those 
given by Ford and Fulkerson [6], [7]. 











INTRODUCTION 

Many practical applications of linear programming can be formulated as transportation 
problems. Mathematically this means that the problem can be expressed as follows: 

Choose nonnegative integers Xj (i=1,..., m; j=1,....n) such that 


n 
(1) >; xj = a (i=1,...m), 
j=l 


so as to minimize 


m n 


(2) c=)) z Cis Xi - 


i=1 j=1 


(Some of the Ci will be arbitrarily large if the corresponding variables Xij must vanish ina 
meaningful solution.) 

In the terminology of the transportation problem, the i represent sources, the j 
represent destinations, and Xi represents the number of units shipped from the jth source to 
the jm destination. The constraints (1) express the fact that a; units must leave the jth source 
and bj units must arrive at the j® destination. 

Such problems can of course be solved by the general Simplex Method for linear 
programming, see Dantzig [2]. But many special algorithms have been proposed for this type 





*Manuscript received May 26, 1958. 
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of problem, taking advantage of the specially simple nature of the constraints. It might there- 
fore be considered futile to add to the number of such special algorithms, particularly as that 
proposed by Ford and Fulkerson [6] is known to be very efficient. 
The chief merit of the new algorithm is that it becomes only slightly more complicated 
if the objective function C is of the form 
m on 
iia 2 id C45 (45) 5 


i=l j=1 


where ¢;. (x; ;) is a general convex function. Dantzig [4] has pointed out that such problems can 
be solved by existing methods, but the present approach provides a particularly compact 
algorithm. This paper is therefore offered primarily as a contribution to the theory of mini- 
mizing a convex function of variables subject to linear inequalities. 

Problems of this type arise for example if one is shipping to meet an uncertain demand, 
and has a loss for each unit by which the supply falls short of the actual demand. See [1], [3], 
[5]. . 

In the next section of this paper the algorithm is defined, and a proof that it must 
terminate is given. A small scale numerical example is solved by the algorithm in the third 
section. Some further remarks are made in the fourth section. 


A FORMAL DESCRIPTION OF THE ALGORITHM 

The algorithm is an iterative one, and must start from some "feasible" trial solution, 
i.e., a set of nonnegative Xi satisfying the constraints. The later work is likely to be shortened 
if this trial solution is chosen so as to have a reasonably small value of C, but this is not nec- 
essary. In some problems with capacity restrictions on the number of units that may be sent on 
certain routes, it may not even be obvious how to find a solution that does not violate the capac- 
ity restrictions. In this case one may imagine that there is a large finite cost for each unit by 
which the amount sent on any route exceeds capacity, and in the first stage of the algorithm one 
can work only with those costs, ignoring the true cost until a genuine feasible solution has been 
found. 


* 
Now, given our trial solution, Xj = Xj (say), we can compute incremental costs 
associated with a unit increase in the number of units sent by each route, i.e., 


U(i, j) = c,, (x, 


* 
ij Gj + 1D) - Oj OG) 


D(i, i) = ¢; (x;,) - cy; (x; >®. 
These costs will be known as the "up costs" and "down costs" respectively. 

We say that our trial solution is "locally optimal" if there exists no feasible solution 
with Xj - Xi | <1 for all i, j, giving a smaller value of C. Further, we can define a “unit 
cyclic change" in the trial solution, specified by a sequence of q distinct rows ij, eos iy and 
q distinct columns iy, = iy? as a change in which Xi is decreased by 1 unit for i = ins 


j = j. (r=1,...q) and x;, is increased by 1 unit for i=i (r=1,...q), where 


j rei?) = Jy 
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’ * 
i Ps 


eel = i;- It is easy to see that any change in the trial solution to one with IX; - Xj | <1 for 


all i, j] can be thought of as a sum of unit cyclic changes involving different elements Xii- It 


follows that our trial solution must be locally optimal if no unit cyclic change is profitable. 
Further, any change of trial solution can be decomposed into a sum of unit cyclic changes such 
that no xij is increased in some cycles and decreased in others (though it may be altered by 


more than one cycle). It follows that a local optimum must be a global optimum if 


* * ons * 
Cij (x; ;) as Cj (x;,) 2 (x;, = Xj) U(i, j) for Xj 2 Xj ’ 


* * - * 
Cj (x;;) - iy (x;,) > (xi, - Xj) D(i, j) for Xj < Xj - 


These inequalities certainly hold if the Cj (x;,) are convex functions. 


The basic idea of the algorithm, an idea that has been exploited similarly in other 
methods, is that if the minimum up cost is greater than or equal to the maximum down cost in 
each column, then no unit cyclic change is profitable, and hence our present trial solution is 
optimal. In this situation we say that the matrix of up and down costs is in "final form." If 
this is not so, then we add a constant to all U(i, j) and D(i, j) for some particular i. This is 


equivalent to adding k Xij to Ci; (x;,)- It adds 2 k Xj =k a; to the total cost of any feasible 
j 


solution, and therefore does not affect the differences between the costs of different feasible 
solutions. We show how a sequence of such additions can be performed so that either the 
matrix is reduced to final form with the present trial solution, or else an improvement in the 
trial solution becomes apparent, in which case we make the indicated improvement and begin 
again. 


We first seek some destination iy and 2 sources iy and io, such that 
D(i,, 51) > Ulig, 51) - * 


If this is impossible, the matrix is already in final form and the problem is solved. 
Having found i,, j,, and ij, we add D(i,, iy) - Ulig, iy) to all costs associated with 


source ig, i.e., to all D(ig, j) and all U(ig, j). This will make Ulig, i,) = D(i,, i,), and is 


therefore a contribution towards producing a matrix in "final form."" But, using the adjusted 
costs, the total cost is unaffected if we send one unit fewer from source iy to destination iy, 


and one unit more from source ig to destination iy, i.e., if we switch one unit required at 
destination iy from source iy to source ip. Note that, since one cannot send a negative 


number of units from any source to any destination, we must write D(i, j) = -« whenever 


* 
x..=0. Soif D(i,, j,) > U(i,, j,) we must have x. = 0; and hence the switch sugggested 
ij 2 *t 2? *1 i, Jy 


above cannot involve sending a negative number of units along any route. But the switch 





*Note that the symbol 7 means "'strictly greater than,'' and not "greater than or equal to." 
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obviously cannot be made unless we also make one or more other such switches so as to leave 
unaltered the total number of units supplied from each source. We are therefore interested in 
finding some destination, say jg, such that the total cost is reduced if we switch one unit 

required at destination jg from source ip to source i;. We could then satisfy all the require- 


ments at a reduced total cost by making the two indicated switches, (which from a unit cyclic 
change with q = 2). 

Even if we cannot find a destination ig for which such a pair of switches is profitable, 
we may be able to find ig, ig and ig such that (after suitable addition to all costs associated 
with source ig) the total cost is unaffected by switching one unit required at destination jo 
from source ig to source ig, and the total cost is reduced by switching one unit required at 
destination ig from source ig to source ij. We could then satisfy all the requirements at a 


reduced total cost by making 3 switches—the two defined immediately above together with the 
original switch of one unit required at destination iy from source iy to source io, (which 


form a unit cyclic change with q = 3 provided these sources and destinations are distinct). 


This process can obviously be extended to 4 or more simultaneous switches. But it is 
clear that our first task is to find some destination ig such that 


D(ig, jg) > Uli, ig) 


for some i. If this applies when i = ij, so much the better. If not, we will be able to increase 
all the costs associated with source i so as to make U(i, ig) = D(ig, jg), and continue. 


Before a complete description of the algorithm is attempted, we consider an additional 
rule that improves the efficiency of the algorithm slightly. We do not allow jg to be the same 


as jj. For at this stage 


D(i,, i,) = U(ig, jx) 


since c, , (x, ,; ) is a convex function. So that if D(i,, j,) > U(i, j,) for some i, then 
ig jy “ig iy 2° *1 1 


D(i,, j,) > U(i, 3,) ’ 


and the whole procedure is simplified if this i is used instead of, and not as well as, ip in the 
set of switches to be made. 


The complete algorithm will now be described. 
We define a "ladder" whose elements are alternately sources and destinations. 
Initially the ladder is empty, and whenever this is so we seek some destination iy and 


2 sources i, and i, such that D(i,, j,) > U(ig, j,)- If this is impossible, the matrix is already 
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in final form and the problem is solved. In practice, given iy it is desirable to choose iy and 
ip 80 as to maximize D(i,, j,) - U(ig, j,), but this does not affect the theory of the method. 
We then enter i, jy and io in that order at the head of the ladder. 
We now have a source, ig, at the foot of the ladder, and can define a general step to be 
taken whenever a source iy is added to the foot of the ladder. This step follows one of two 


courses: 


(1) If the source ip is not the same as i_ for any q <p, then add 


q 


D(iy_ yy Ip-1) - Ulips ip_y) 


to all D(i,,, j) and all Uli, hy 


Now seek a destination ip’ other than ip-1 (if p > 2), such that D(i,, ip) =a. ip) 


for some i. (In practice it is tentatively recommended that one tries each destination in turn, 


starting with ip-1 +1, or with 1 if ip-1 =n.) 


Three cases must now be distinguished. 
(a) If such a destination can be found, it is entered at the foot of the ladder, followed 
by a source ty41 such that 


(4) Dliy, ip) > Ulingy ip) - 


(Theoretically it does not matter which source satisfying (4) is chosen as ty 41° But it is 
recommended that, if p > 2, one should try ib-1 or if that does not satisfy the required con- 
dition iy-2? and so on back to i;- If none of these is possible, or if p = 1, one should choose 


inal to maximize D(i,, ip) - Ulin ip)-) One can now re-enter the step to be taken when 


there is a source added at the foot of the ladder, with p increased by 1. 


(b) If there is no destination ip satisfying (4) for any ip me 


must be removed from the ladder, and a new destination jp-1 must be sought, i.e., the program 


but p> 2, i, and jy 


is re-entered immediately after the last asterisk with p reduced by 1. (The practice recom- 
mended above involves first trying the same jp-1 with a different iy? and if this fails trying 


the other destinations in turn, proceeding to 1 after n. If this is done, we may speak of remov- 
ing only the source iy from the ladder whenever we can find a replacement using the same 


iy-1 


(c) If there is no destination ip satisfying (4) for any ™ and p = 1, the ladder 


i 

p 

is cleared, and the step defined above for when the ladder is empty must be repeated. 
(2) If however in = iy for some q < p, the trial solution is changed as follows: 


Reduce x, , by 1 andincrease x, 
a i 


a unit cyclic change if j q 


by 1 for r=q,q+1,...,p-1. (This is 
; mis r+lJr 
aaecata ip-1 are distinct.) 





*In practice this addition need not to be made. One can instead have a setof numbers wj repre- 
senting the amount deemed to have been added to all D(i, j) and U (i, j). 
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It is easy to see that, since in = iy we still have a feasible solution, and C is reduced 


by D(i,_y) ip-1) - Uli, ip-1)» if ig Pi ip-1 . But the suggested procedure ensures this, for 


D(i,_1 ip-1) > Uli, jp- » ” Uli, jp- » ’ 


Uli, iq) 2 D(i,, iq) - Uligyy iq) ’ 


so if ig = jp-1 we could and would have chosen iy = tg4l in preference to i,. It will be 


q’ 
=e 

a’q ‘aq 

We now examine the complete cost function C to see if a further change of trial solution 


on the same lines would reduce C still further or not. Having made the best possible change of 
this type, we adjust the values of the U(i, j) and D(i, j) to allow for the new values of some of 
the x" and restart the algorithm from the new trial solution. It is suggested that the additions 


observed that this argument depends on the convexity of the function ci 


that have already been made to the U(i, j), and D(i, j) should be retained since they are likely 
to leave the matrix more nearly in final form than it was originally. It is further suggested that 
when the algorithm is restarted one should use the previous value of jp-1 as the new jj: This 


, and a further reduction in this 


is because our previous step reduced the value of Xj 
p 


-1Jp-1 
element may well be profitable. 
The algorithm is now defined, and the next thing to do is to prove that it must terminate. 
First we note that there are only a finite number of possible feasible solutions with all 
the Xij integral, and the trial solutions used in the algorithm all have a value of C strictly 


smaller than the previous value of C. So the algorithm must terminate unless the procedure 
while we are dealing with one particular trial solution fails to terminate. (Throughout most of 
the computation this procedure is normally very quick: one puts two or three sources into the 
ladder and a better trial solution emerges. It is unfortunately true that when approaching the 
optimal solution one is frequently involved with one or two trial solutions requiring a large 
number of such steps. Further experience is therefore needed before one can be confident that 
the method is satisfactory in practice. But it can be proved that the number of steps is finite, 
and this will now be done.) 

Now, given a source at the foot of the ladder, either this source is already present 
further up the ladder, in which case a cheaper trial solution is already apparent, or else ina 
finite number of steps, either it will be removed from the ladder or another source will have 
been added further down the ladder. It is also clear that the number of sources in the ladder 
cannot exceed the total number of sources without some source being repeated, and a cheaper 
trial solution thereby becoming apparent. It is therefore only necessary to prove that the 


ordered set of sources and destinations in the ladder can never repeat itself for any given 
trial solution, 





*I am indebted to Dr. D. R. Fulkerson for pointing out the necessity for this condition. 
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We first show that this cannot happen without the ladder becoming empty. Let us sup- 
pose the contrary, and that tb is the source furthest down the ladder that has not been removed 


between the successive appearances of some particular set of sources and destinations in the 
ladder. 
Let ip and in4l be the next two elements in the ladder on these occasions. Then the 


first time (4) must have been satisfied, and we must have increased all elements D(i,.4> j) 
)=U(i 


and Ulin j) to make D(i ). After this we cannot have increased D(i 


p’ !p p+l’ Jp p’ Jp 
since ty cannot have appeared again in the ladder or else we would have been able to improve 


our trial solution. Further, we cannot have decreased Ulin yy ip) because elements are never 


decreased in this algorithm. Consequently (4) cannot have been satisfied again and the required 
result is established. : 
We show next that the set of sources in the ladder cannot repeat itself if the ladder 
becomes empty. To do this we define a source s as "inactive" if D(s, j) - U(i, j) < 0 for all i 
and j. Nowa source is only removed from the ladder when it is inactive, except that if p > 2, 


i, may be removed while D(i,, ip-1) > Uli, ip-1) for some i—this complication will be con- 


sidered later. Further, a source can only cease to be inactive if something is added to D(s, j), 
and this can only happen if the source goes back on the ladder again. Whenever the ladder is 
empty all sources that have been on it are inactive. If all other sources are inactive, the 
matrix is in final form and the iteration terminates. If not, we must find some source that has 
not previously been on the ladder to put at the head of the ladder. This completes the proof that 
the iteration must terminate, except for the complication mentioned above. To deal with this, 
note that 


D(i,_y) ip-1) - Ui), ip-) ’ 


since a constant will have been added to all Di(i,, j) and all U(i,, j) to achieve this when ip 
was added to the ladder. Further 


Uli, ip-1) 2 D(i,, ip-1) ’ 
since Cij (x;;) is a convex function. Hence 
D(i,_> ip-1) aa Uli, in-1) 2 D(i,, ip-1) - U(i, ip-1) . 


It follows that even if D(i,, ip- »? U(i, ip- » when ip is removed from the ladder, 


U(i, ip-1) must be increased sufficiently for this not to be so before iy-1 can be removed 


from the ladder. This complication does not arise when p = 1, so our basic conclusion that 
whenever the ladder is empty all sources that have been on it are inactive remains true. 


A NUMERICAL EXAMPLE 
The algorithm will now be illustrated by a small-scale artificial example. 
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Suppose we have 5 units at each of 4 sources, and desire to send 4 units to each of 5 
destinations, and suppose that the costs of shipping 1 unit over any route are as follows: 


4 
3 
4 
1 


1 5 
1 9 
8 3 
2 6 


Suppose further that the cost of shipping a second unit over any route is 2 more than indicated 
above, that the cost of shipping a third unit is 4 more than indicated above, and that it is 
impossible to send more than 3 units over any route. 

To illustrate the method of finding a genuine feasible solution when there are capacity 


restrictions, we start from a very simple first trial solution that in fact violates the capacity 
restrictions, namely 


0 
0 
0 
4 


For each route we now write the value of U(i, j) followed by a stroke and'the value of 
D(i, j). If true costs are temporarily ignored, and we work only in units of the fictitious large 
costs associated with exceeding capacity restrictions, the matrix of up and down costs becomes 


+/1 0/0 0/- 0/- 0/- 
0/- +/0 0/0 O/- 0/- 
0/- O/- 0/0 +/0 0/- 


0/- O/- O/- 0/0 +#+/1. 


+ 
As always, the down costs associated with routes where Xj = 0 are negatively infinite. 
* 
They are here denoted by -. The up costs associated with routes where Xj is equal to or 


greater than capacity are here taken to be positively infinite and denoted by +; though it would 
have been equally reasonable to take them as 1 at this stage, since we are working in units of 
the fictitious large costs associated with exceeding capacity restrictions. 

It is convenient in hand computation to use capital letters for sources and small letters 
for destinations. If we arbitrarily consider putting the first destination a in the ladder as iy, 
we find that sources A and B can be taken as iy and ip, since D(A, a) - U(B, a) =1>0. 
Adding 1 to all D(B, j) and all U(B, j), we find that D(B, b) - U(A, b) =1>0. Hencea 
reduction in the number of units sent over routes (A, a) and (B, b) and an increase in the 
number of units sent over routes (B, a) and (A, b) reduces the objective function, All this can 
be expressed conveniently as follows: 
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w 
i 

Trial Solution Matrix of up and down costs (= Amount to be 

added to each row) 





+/1 0/0 O/- O/- 0/- 0 


0/- +/0 0/0 O/- 0/- 61 


0/- O0/- 0/0 +/0 0/- 


0/- O/- O/- 0/0 +/1 




















We must now consider the complete objective function to decide whether a change of 
more than 1 unit is profitable. Here a change of 1 unit saves only 1 cost unit, and this does 
not apply to a second change since the entry (Aa) is now within capacity. 

We now start again with the new trial solution, having modified the matrix of up and 
down costs accordingly. If, as recommended, we start the ladder with iy = b, we reach the 
following situation. 





Trial Solution Matrix of up and down costs 





+/0 0/0 O/- O/- 0/- 


0/0 0/0 0/0 O/- O0/- 


0/- 0/- 0/0 +/0 0/- 


d 
A 


0/- O/- O/- 0/0 +/1 | @1 




















At this stage we are in trouble, because no D(A, j) exceeds the corresponding U(i, j). 
So Row A must be deleted from the ladder, and so must Column d, since D(D, d) exceeds no 
U(i, d). We are therefore forced to consider the route (De), and we find that D(D,e) = 2 and 
exceeds all U(i,e). Our rule is to go to the row written lowest down on the ladder, because 


this gives the simplest change of trial solution. The whole of this stage of the algorithm is then 
expressed as follows: 
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Trial Solution Matrix of up and down costs 





+/0 0/0 O/- O/- 0/- 


0/0 0/0 0/- 0/- 


0/- 0/- +/0 0/- 


de 
0/- O/- O/- 0/0 +/1 61 Ac 




















A single change now gives a genuine feasible solution. We can therefore start working 
with true costs and abandon the present values of the amounts to be added to each row. 

Note that it would have been quite reasonable to deviate from the recommendation about 
the choice of iy, and to have started this stage of the iteration from some entry containing an 
inadmissably large value in the present trial solution, i.e., (De). This would have shortened 
the number of steps required; but the procedure actually adopted illustrates how the method 
eventually finds out what to do without any outside information. 

Starting from our genuine feasible solution, we have a working sheet as follows: 





Trial Solution Matrix of up and down costs : Ladder 





+/5 5/3 5/- 5/- 4/- D 


3/1 5/3 5/3 9/- 3/- 


8/- 5/- 3/1 +/1 6/4 








2/- 5/- 7/5 8/6 +/5 














We see that a change of one unit in this way reduces C by 1 cost unit. We must now 
consider whether a further change on the same lines would be profitable. The answer is in the 
negative, since we cannot reduce the number of units sent from B to a by more than one with- 
out this number becoming negative. 

We therefore proceed as follows: 
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Trial Solution Matrix of up and down costs Wi Ladder 





+/5 5/3 5/- 5/- 4/- | 0 A 


1/- 5/3 5/3 9/- 5/3 


8/- 5/- 3/1 +/7 6/4 


4/2 5/- 1/5 8/6 5/3 




















A change by more than one unit would involve exceeding the capacity of the route from 
A to b. 





Trial Solution Matrix of up and down costs 





5/3 +/5 5/- 5/- 4/- 


3/1 3/1 5/3 9/- 5/3 


8/- 5/- 3/1 +/7 6/4 


a 
4/2 5/- 7/5 8/6 5/3 B 




















Note that at the beginning of the above stage it was impossible to choose b as iy, so we 
proceeded to the next destination. Note also that the change in the trial solution here involves 
6 routes instead of the usual 4. This time a change by a second unit is possible, but the relevant 
down costs are all reduced by 2 and the relevant up costs are all increased by 2 after a single 
change. Consequently the reduction in C is diminished by 2x 6 = 12 and becomes negative. A 
single change only is therefore indicated and we proceed as follows: 





Trial Solution Matrix of up and down costs : Ladder 





3/1 +/5 5/- 1/5 4/- 


5/3 3/1 3/1 9/- 


8/- 5/- 5/3 


5/- 7/5 
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Trial Solution Matrix of up and down costs 





3/1 +/5 5/- 7/5 4/- 


3/1 3/1 5/3 9/- 5/3 


8/1 5/- 5/3 7/5 6/4 | 45 





2- 0 0+ 6/4 5/- 5/- 8/6 5/3 | 26 

















Again we had to remove a source and a destination from the ladder, but a cheaper trial 
solution became apparent soon afterwards. 





Trial Solution Matrix of up and down costs 





5/3 5/3 5/- 7/5 4/- 


3/1 5/3 3/1 9/- 5/3 
8/- 5/- 5/3 7/5 6/4 


4/2 5/- 7/5 8/6 5/3 




















At the beginning of the above stage we first tried to choose a as iy: This was impos- 
sible so we chose b and found we could choose B as iy and C as in. But C had to be 
removed from the ladder, and the only other possibility, retaining B as ij, was to choose e 
as jy and A as ip. But A then had to be removed from the ladder, and the ladder then 
became empty. There are two ways of satisfying oneself that the matrix is now in final form. 
In hand computation the easiest is to note that Sources A, B and C have all been removed 
from the ladder and so must be inactive. One can easily see that Row D is also inactive. The 
alternative is to scan each column of the matrix of up and down costs in turn, comparing the 
largest D(i, j) with the smallest U(i, j) for each j. 


DISCUSSION 

The algorithm can of course be used when the cost function is in fact linear, i.e., given 
by (2). In that case U(i, j) = Cis and D(i, j) = U(i, j) if x >0 or Dii; j)=-o if Xi =0. We 
therefore need only one binary digit to specify D(i, j), or none if the Xj are recorded with the 
U(i, j). In hand computation one may either record the Xi and the U(i, j) separately, under- 
lining the U(i, j) when Xi # 0, or else record them together. 

Similarly, when the cost function is linear but there are capacity restrictions on the 
number of units that may be shipped over each route, so that Ci (x; .) =); Xj, for 0< Xi, < x 
but ¢;; (x; 5) is arbitrarily large when Xj > Xj one ternary digit is needed in addition to Cj 
to specify U(i, j) and D(i, j). 


iM 
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However there is no evidence that the present algorithm is superior to those given by 
Ford and Fulkerson [6], [7], for these problems. 

A notable feature of the present algorithm is that the concepts of basic and nonbasic 
solutions play no part in it. A good consequence of this is that when solving a transportation 
problem involving a lot of degeneracy, such as the allocation problem where all the a; and b. 
equal unity, one is not held up by having to make changes of basis without changing the trial 
solution at all. But a bad consequence on other types of linear problems is that one may be led 
to consider nonbasic trial solutions in which many of the “ are small. Consequently the 
number of units that can be switched when changing from one trial solution to the next is liable 
to be small—it is obviously equal to the smallest es to be decreased. The number of trial 
solutions that have to be considered before reaching the optimal solution may be correspond- 
ingly large. 

One would like to be able to solve the transportation problem when the Ci (x;;) are 
general, i.e., not necessarily convex, functions. Methods based on Gomory's [8] recent work 
on solving linear programming problems in integers, combined with Markowitz and Manne's [9] 
device for dealing with nonconvex piece-wise linear objective functions, will probably be 
developed in due course. If the present approach is used, the fundamental difficulty is that one 
cannot be sure that a local optimum of such a problem is a global optimum. And even finding a 
local optimum seems to be a difficult task. With the present algorithm, one cannot hope to put 
the matrix of up and down costs in "final form" if D(i, j) > U(i, j) for some route. One could 
try using fictitious values of D(i, j) and U(i, j) that are equal to each other and intermediate 
in value between the true D(i, j) and U(i, j) for any route with D(i, j) > U(i, j). It is clear 
that if an improvement on the present trial solution can be obtained assuming these fictitious 
costs, then it will also be an improvement using the true cost functions. It is also easy to see 
that if the present trial solution is optimal for all sets of such fictitious up and down costs for 
these variables, then it is a true local optimum: indeed only the extremes of the ranges of the 
fictitious costs need be considered, i.e., the fictitious costs may be put equal to either the true 
up cost or the true down cost. But the amount of exploration that one could reasonably do in 
this way might well be very limited, particularly as the local optimum one could hope to find at 
the end of it might not be a global optimum. : 

Some remarks should perhaps be made on the choice of sources and destinations to be 
put into the ladder when alternatives are available. Other things being equal, it is obviously 
desirable to choose ip and in41 so that 


(5) D(i,,j,) - UC 


p’ Jp 


is as large as possible. This will tend to reduce the number of sources and destinations that 

have to be put into the ladder before a better trial solution appears, and also to provide a 

worthwhile improvement on the present trial solution when a change is made. But it is not at 
‘all obvious how far it is worth going to achieve this end. 

It is tentatively suggested in the second section that it is not worth considering all 
possible destinations at each stage, but that esl should be chosen to maximize (5) given ip? 
unless a cheaper trial solution can be found immediately. But if the j, are, so to speak, not 
chosen on merit, it seems very important that each destination should be considered about 
equally frequently as a possible ip Otherwise one may waste time making minor adjustments 
while hopelessly inefficient routes are still being used in the trial solution. In particular, one 
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should not start from destination number one and work upwards each time, though this might 
be the easiest thing to program on a digital computer. It is tentatively suggested in Section 2 
that the destinations should be scanned in order, starting from ip-1° It is felt that this is the 
simplest procedure that may not be too inefficient in practice, but experience on some medium 
or large sized problems is required to confirm this. 

Similarly, it might be worth adding an extra row to the matrix of up and down costs, 
giving Min D(i, j) for each j, together with the value of i concerned. This would obviously 

i 


simplify the task of seeing whether a destination was a possible one for inclusion on the ladder. 
One possibility is to record this quantity only when it is computed in the normal course of the 
iteration, and to delete it when the row i is added to the ladder (thereby causing a change in 
D(i, j)), or when the trial solution is changed. 
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EQUILIBRIUM POINTS IN GAMES WITH VECTOR PAYOFFS 
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Introductory Note 
by Fred D. Rigby, Office of Naval Research 


Historical. The topic of games with vector payoffs is one which 
could be expectedto attract attention onthe basis ofits intrinsic interest. 
However, the history of the particular problem treated in Dr. Shapley's 
paper was not of this kind and may have some interest of its own. 
During an interval when the writer of this note was engaged in opera- 
tions research his group was asked to analyze a combat situation in 
which movement of forces and the inhibition of such movement were 
critical. It turned out to be feasible to represent essential aspects of 
this situation by a game-like model with reasonably well defined courses 
of action corresponding to pure strategies for each side. However, each 
pair of these strategies, one for each player, generated botha time delay 
in the movement and losses to the moving forces not balanced by losses 
to the inhibiting player. That is, the model was a game withvector pay- 
offinthe sense of this paper. Since the values of these delays and losses 
would be realized in a subsequent battle in the area approached by the 
movement in question, it was clear that some sort of exchange ratio 
between attrition and delay must exist. Efforts to obtain estimates of 
such a ratio failed completely. In fact, several rather arbitrary weight- 
ings of delay and attrition were assumed and the resulting numerical 
games solved for the sake of the insight they could provide, but this 
was a rather unsatisfactory expedient. Had the theory presented in this 
paper been available, it would have been possible at least to sharpen the 
questions asked in pursuit of the ''exchange ratio,'' and probably to 
eliminate rationally considerable sets of strategy pairs as unsuitable or 
as representing extreme cases in the weighting. 


In any event, the theory was not at hand and the study could not 
wait on its development. The problem was, however, brought to the 
attention of Dr. Shapley, with this paper as a consequence. 


Nature of the Results. Perhaps a comment on the kind of results 
which are presented is in order sincethere is a sort of fitness to them. 
As was noted above, the occurrence of vector payoff represents a 
failure to resolve some of the questions whose answers are needed in 
order to construct a game model. This being the case, one should not 
be disappointed that the theory does not produce a clear cut, well defined 
solution concept. What it does produce, if the payoff vectors have K + l 
components,is a 2K parameter set of equilibrium points, in comparison 
with a unique solution for a numerical game similarly specified. (The 
fact that numerical game solutions need not be unique hasits counterpart 
among vector game equilibrium points also, of course.) Such an "equi- 
librium set,'' however, need not cover a major part of the joint mixed 
strategy space, even in games with only a few pure strategies for each 
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player. Accordingly, although finding these sets may be extremely 
tedious for a particular game,they may well turn out to be worth having 
in the sense that they limit the domain over which weighting comparisons 
of the payoff vector components need be made. Since Dr. Shapley's 
results concern both ''strong'' and ''weak'' equilibrium points it is 
pertinent to note that this distinction appears "usually" to concern the 
boundaries of the "equilibrium sets.'' That is, one might expect that the 
set of strong equilibrium points omits boundary points which are included 
in the weak case. It should be remarked, however, that this charac- 
terization of the distinction is not precise or universal, and instances of 
its failure can be produced. 


INTRODUCTION 

The payoff of a game sometimes most naturally takes the form of a vector having 
numerical components that represent commodities (such as men, ships, money, etc.) whose 
relative values cannot be ascertained. The utility spaces of the players can therefore be given 
only a partial ordering (representable as the intersection of a finite number of total orderings), 
and the usual notions of solution must be generalized. In this note we define and characterize 
the noncooperative (equilibrium-point) solutions of such vector games.* 

Maximization of vector-valued objective functions has previously been considered in 
certain generalizations of the linear programming problem (see [6] and [7]; also [8]). There 
appears to be no direct link between these references and the present work, beyond the com- 
mon use of the general principle: a vector function is maximal (in some sense) if and only if 
its inner product with some positive (in some sense) vector is maximized. 


DEFINITIONS 


Let R” denote the space of vectors a= (al, ee a”) and define two order relations on 
n 
i 


aQ@b bX, all k, 


k 


> b*, all k, with 


a BD] b 


pK at least once. 


It is an easy matter to verify the following two properties: 
(1) a © 0 —— > aa>od, alla BP 0, 


(2) ap)o <= Ss aa>d, alla@® 0, 


where aa is the inner product )"y aX aX of the vectors a and a. 


The game to be considered has a zero-sum payoff matrix A = (a;,), each aj, € R". It 
is assumed that the players respond linearly to probability mixtures of vectors, so that the 
"mixed" outcome consisting of the vectors 





*We are indebted to Dr. F. D. Rigby of the Office of Naval Research for several suggestions 
relating to this problem. 
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. 3: N 
44, with probability py} v=1,...,N, ZL P,, = 1, 


is entirely equivalent to the "pure" outcome » I 1 Py ay eR", (Compare the "mixture" 


yy 
axioms of [1], [2], or [3].) It is also assumed that the first player wants to increase the com- 
ponents of the vector, and the second player wants to decrease them. Finally, it is assumed 
that neither player has an a priori opinion concerning the relative importance to himself of the 
different components. We shall concentrate on the strongest and weakest preference orderings 
compatible with these hypotheses—namely, () and and their inverses—and omit discussion 
of the numerous intermediate cases. 

Since the goals of the players are as directly opposed as possible, considering the 
incompleteness of their preferences, it is natural to look first at the "noncooperative" types of 


solution. Writing xAy for Li a Xa; ¥j» we define the mixed strategy pair (x*, y*) to bea 


strong equilibrium point (SEP) if and only if the vector v = x* Ay* is simultaneously maximal 
in the set 





(3) {xAy* |x arbitrary} 
and minimal in the set 
(4) G = {x*Ay | y arbitrary}, 


x * 
in the sense of the ordering [>]. That is, there is no strategy x for which xAy- x Ay and 


no strategy y for which x" Ay" x" Ay. Similarly, (x", y) is a weak equilibrium point 
(WEP) if v is maximal in (3) and minimal in (4) in the sense of @). Clearly, every SEP isa 
WEP. 





CHARACTERIZATION OF EQUILIBRIUM POINTS 

We wish to find a characterization of the SEP and WEP just defined, in terms of 
ordinary equilibrium-point theory [5]. First let us introduce a set of nonnegative "weighting 
factors" a@ = (a}, pose a”), and consider the resulting numerical game aA = (a aj). If x’, y 


* * 
are optimal mixed strategies in this game, andif v= x Ay , then av is its value, and we have 


(5) a(xAy™ -v) <0, allx, 


(6) a(v-x Ay) <0, ally. 


Now, if a 0, then xAy™ - v is never ©) 0, by (1) or (2); likewise v - x Ay @ 0. Thus, 
* * * * 
(x , y ) isa WEP. In similar fashion, if a © 0, then (x , y ) is a SEP. 
This shows that equilibria of both types always exist, and provives a simple way of 
finding some of them—namely, by solving certain zero-sum numerical games of the form aA. 


However, to find them all we must assign different weighting factors, a = (a, ones a’), 
B = (a1, ..., 6"), to the two players’ payoffs. This leads to a non-zero-sum numerical game, 





60 L. S. SHAPLEY 


described by the pair of matrices [a A, - 8 A]. This game, of course, has equilibrium points 


of the ordinary kind—namely, pairs (x", y) such that 
(7) x” (a A) y x(a A)y" » aie, 


(8) x (BA)y <x (BA)y, ally. 


From these, expressions similar to (5) and (6) can be derived, and we see as before that 


(x", y') is a WEP of the vector game if a [5] 0, 6 [5] 0, andis aSEP if a © 0,80 0. 

It remains to show that all WEP and SEP of the vector game A can be obtained from 
the games [a@A, - £8 A] by the above method. (Our original approach is, of course, included, 
via the special case a =.) Let (x*, y*) be a WEP of A, let v = x* Ay*, and let F and G be 


the convex sets in R" defined by (3) and (4). Let ¢( © , F) be the extension of F obtained by 
including all vectors b such that a © b for some ae F. The extended set is still convex, and 
has the same maximal vectors (in the sense of G)) as F. Hence, v is maximal in *( ©, F), 
and lies in its boundary. Consider any hyperplane that supports *( © , F) at v; if a@ is its 
(outward-pointing) normal, then we have , 


(9) a(v-a)>0, all aee(O,F). 

In particular, we have a(v - a) > 0 whenever v@) a, since the set of vectors so dominated by v 
is open and contained in °( G@) , F). This implies that a 0, by (2). A similar construction 
determines af 0 such that 


(10) B(v-b) <0, all be®( ©,G). 


But F is contained in *( ©), F); hence (9) with (3) reduces to (7). Similarly (10) and (4) give 


* - 
(8). This shows that our WEP (x , y ) is an equilibrium point of the numerical general-sum 
game [a@A, - BA], with a BG] 0, 6 GB) 0. 
The corresponding proof for SEP is more complicated. Let (x", y’) be a SEP of A and 


construct the extended set *( , F). Let C” be the r-dimensional set obtained by intersecting 
a( , F) with all supporting hyperplanes at v. Since °( , F) is closed and polyhedral,” in 
addition to being convex, it follows that Cc” is the closed boundary face of lowest dimension 
containing v, and that v is in the (relative) interior of C’. We can finda supporting hyper- 
plane whose intersection with °( , F) is precisely c (any interior member of the set of all 
supporting hyperplanes at v will do). Then, if a is its normal, we have 


a(v-a)>0, all ae*(G),F), 


with equality only if aeC’. 





*It is essential here that © ( , F) be polyhedral (see Chapter 1 of [4], especially pp. 6-8). In 
fact, a supporting hyperplane with strictly positive normal does not exist everywhere, if (say) 
F is a sphere. 
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We claim that a (v - a) > 0 whenever v a. If not, there would be a vector a€ al 
such that v a. But, since v is in the relative interior of Ce. there would also be a vector 
b (of the form b=(1+¢e)v-¢a,« >0) in c’ such that b v. This contradicts the 
maximality of v. Thus, we conclude by (1) that a G) 0. We can now proceed as in the previous 
proof. We sum up: 


THEOREM: The WEP of the zero-sum vector game A are precisely the equilibrium 
points of the general-sum, numerical games [a A, - 8 A] with a 0,8] 0. The SEP of A 
are precisely the equilibrium points of the games [a A, - 8 A] with a©@ 0, © 0. 


EXTENSIONS TO INFINITE AND NON-ZERO-SUM GAMES 

In vector games with infinitely many strategies the characterization of WEP is exactly 
analogous; but the characterization of SEP is more complex, since the sets F, G are not 
necessarily polyhedral (see the footnote). 


A similar result holds for non-zero-sum vector games. In this case it is not even 
necessary that the payoffs of the two players be vectors in the same space. Thus, the WEP of 
[A, B], where ay i ae bi R", are precisely the equilibrium points of [a A, 8B], where 


a€R™ Be R", and @ 0, B 0, etc. The arguments in favor of the noncooperative type of 
solution, however, are less compelling for these games than before, in the zero-sum case. 
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A TERMINATING ITERATIVE ALGORITHM FOR SOLVING CERTAIN GAMES 
AND RELATED SETS OF LINEAR EQUATIONS 


R. B. Braithwaite* 


University of Cambridge 
Cambridge, England 


1. It was conjectured by G. W. Brown [2], and proved by Julia Robinson [7], that a partic- 
ular iterative method of successive "fictitious plays" of any two-person zero-sum finite game 
will determine the value of the game and each player's optimal strategies as limits of con- 
vergent sequences. (The method is described in [5], Chap. 4.) The iteration is effected by 
each player, simultaneously or alternately, always choosing a pure strategy which would be 
best for him to use against the accumulated mixed strategy of his opponent up to that point, 
i.e., assuming, before each play, that his opponent will reply to it with the mixed strategy 
equivalent to the combination of plays he has made in the past. Approximations to the value 
and to the optimal strategies can be found by this method, which can be applied to all two- 
person zero-sum finite games. 

This paper will explain, and justify the use of, an iterative algorithm, discovered by the 
author while experimenting with the Brown-Robinson method, which, though applicable only to 
certain special classes of games, will give their exact values and optimal strategies after a 
predeterminate number of steps. Described in terms of fictitious plays, the iteration is 
effected by the players playing alternately, the player playing second always choosing, as in the 
Brown-Robinson method, a pure strategy best against his opponent's accumulated mixed strat- 
egy, while the first player always chooses a pure strategy best against the last strategy used 
by his opponent. That is to say, the second player always acts on the assumption that his 
opponent will continue to play with the frequency distribution of pure strategies which he has 
exhibited in the past, the first player on the assumption that his opponent will repeat his last 
play. For the classes of games concerned the iteration will repeat itself in cycles of pre- 
determinate length, the end of each cycle being recognizable as a stage at which the first 
player's accumulated mixed strategy yields the same outcome for all strategies of the second 
player. Consequently a "stop rule" can be included in the algorithm so as to terminate the 
iteration at the end of the first cycle, and this cycle will determine both the first player's 
optimal strategy and the value of the game. The second player's optimal strategy can be 
determined by another similarly terminating iteration with the players' principles of play 
exchanged. 

Games to which the algorithm can be proved to apply are completely mixed games with 
certain diagonal properties, similar to those investigated by H. F. Bohnenblust, S. Karlin and 
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L. S. Shapley [1, ££ 15-19]. Their results, deducing complete mixture from certain diagonal 
properties, are slightly improved in £7, and the games concerned are shown in £8 to be 
algorithmically soluble. £9 applies the algorithm to find least common multiples. 

Since solutions of the completely mixed games to which the algorithm applies are 
equivalent to solutions of certain sets of linear equations, but its application is more general 
in this latter case, the algorithm will be defined (§3) and its use justified (£4) in respect to 
solution of linear equations. In terms of fictitious plays, each step (after the first) of the 
iteration as defined in §3 will be equivalent to the conflation of a first-player play with the 
preceding second-player play; but Theory-of-Games considerations will not appear explicitly 
until £6 is reached. 


2. Let A denote the » xn square array whose elements are aij» with row index i and 
column index j. Let A denote the determinant of A, and let Ai denote the co-factor of aij 


in A, Let P=L LU Ajj: (All summations will be over n terms unless otherwise indicated.) 
e 3 = 


We shall be concerned with a set of n + 1 simultaneous linear equations in the n + 1 
variables X41, Xo, +++ X),Z 


Bast, L ajj%j = 2 for every j 


(which will be called the x-set associated with A) and a set of n +1 simultaneous linear 


equations in the n + 1 variables Yy> Vor +++ Vv Z 
= Z for every i 


(which will be called the y-set associated with A). Solutions for the x's of an x-set, or for 


the y's of a y-set, are invariant with respect to any linear transformation T: aye aij +f of 


the elements aij of A; solutions for the z of either set undergo the transformation T. 


A solution with X4, Xo, +++ X, > 0 of an x-set, or with Yy> Yor +++ Vy” 0 of a y-set, 


will be called an all-positive solution. By Cramer's rule, a necessary and sufficient condition 
for an x-set or y-set to have an all-positive solution is: for an x-set, that 2X A;, should be 


J 
either positive for every i or negative for every i; for a y-set, that 2 Ai; should be either 
i 


positive for every j or negative for every j. An x-set with an all-positive solution has the 


unique solution X; = po A,,/T for every i, z=A/l; a y-set with an all-positive solution has 
j 


the unique solution Yj = L A,,/T for every j, z= A/T. 
i 


3. Description of the iterative algorithm 
Regard the jth row of the array A as being an n-fold row-vector rj. 
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Construct a sequence of n-fold vectors recursively as follows: 
¥, is to be any row-vector of A; 


Ym 4 4 is to be constructed by the vector addition to v,, of a row-vector rj of A 


th 


whose row index i is such that the i~ component of Vm is a minimum of the components of 


Ym 


The sequence of cumulative-row-vectors Vy» .-- will be said to terminate at vy 


Vo» 





if v, is the first vector occurring in the sequence having all its components equal to some 


number v. 
If the sequence terminates at ‘VK > Yq will be equal, by its method of construction, to 


the vector-sum 1 k; f;, with each k; a non-negative integer and 1 kj =k. Since all the 
i ia i 
components of 1 Kj rj are equal tov, x; = k; /k for every i, z = v/k will be the unique solution 
i = 


of the x-set associated with A. 


We will prove that, under one or other of two sets of conditions upon the elements of 
A, the sequence of cumulative-row-vectors will always terminate in a predetermined number 


of steps, and hence that the iterative algorithm will provide the solution of the x-set associated 
with A. 


4. Calla vector w = {w,, Wo, -+- Wnt} with XL wi= 1 and Wy, Wo, --- Wy > 0 an n-fold 
i 


weight-vector, and call it all-positive if Wy, Wo, +--+ Wy, > 0. 


THEOREM I: If an nxn array A is such that 


(1) all its elements are integers, 
(2) its associated x-set of equations has an all-positive solution, 
(3) there is a weight-vector w = {¥y Wo) +++ Wn} such that a._< 2) w, a.. for every 


re j j "ri 
r, Ss, r + s (i.e., no nondiagonal element is greater than the w-weighted mean of its 
row, for a column weighting the same for each row), 
then every cumulative-row-vector sequence Vy, Vo, --- constructed out of A by the iterative 


algorithm will terminate at its term, where k = ['/H, H being + the greatest common 
divisor of z Ai} i= 1, 2,...n, the positive or negative sign being taken according as I is 
J 


positive or negative. 


PROOF: Let b; denote mm A,,/H, and call b,, which is an integer and positive by (2), 
J 


the appropriate number for the row-vector r, of A. 2 b; =k. A cumulative-row-vector 
i 
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which is b; r; (i.e., the sum of each row-vector taken its appropriate number of times) 
i 


has as its sth component 1 b; a, = es Ai; a; ,/H = A/H; so all its components are equal. 
7 i ij 4% 


Since the greatest common divisor of by, bo, nee b> k is 1, there is no set of positive 


integers C,, Co, --- C, with L ci < k and ,/2 c= b,/k for every r, and consequently no 
i 


cumulative-row-vector with equal components terminating the sequence before its xh term 
is reached. Hence the sequence terminates if z b; r; is a term in the sequence, in which 
i 


case the sequence terminates at this term. We will prove that 2 b; rj always occurs in a 


i 
sequence constructed by the algorithm. 


The proof consists in showing that if, in forming a cumulative-row-vector vy» of the 
sequence a row-vector f, has been used its appropriate number of times whereas at least 


one row-vector has been used less than, and no row-vector has been used more than, its 
appropriate number of times, then r, will not be the row-vector added at the next step to vp 


to form vy44. A consequence of this will be that no row-vector can be used more than its 
appropriate number of times in forming a term of the sequence until after every row-vector 


has been used its appropriate number of times in forming the term 2° b; rj Of the sequence. 
i 


It will follow that, since the sequence of cumulative-row-vectors is constructed by adding one 
and only one row-vector at each step, this term must be reached at the th step in the iteration. 
To prove that r, will not be the row-vector added to vy to form vp,, , itis sufficient 


to show that the rth component %, of v S} will never be a minimum component of Vp: 
with 0 <c; < b; foreveryizr, 
= 2 bh a - é ; 4, With O< d; < b; foreveryi:zr, 
=A/H- L 
Lie? 


Now consider the value V of the w-weighted mean of the components of Vg- 
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Hence v.-V= 2 d. (=. wa. - a...) > 0. 

* i,itr *\j 4 ir 

If %, > Vv, v, cannot be a minimum component of Vp- 


If v, = Vv, Vv, can only be a minimum component of vy if all the components of v fp are 


equal, which is impossible, since at least one of the row-vectors has been used in the forma- 
tion of vp less than its appropriate number of times. 


Hence in neither case can the cumulative-row-vector sequence be continued by the 
addition of r, to vy, and Theorem I is proved. 


THEOREM I: If an nxn array A satisfies conditions (1) and (2) of Theorem I and 


also the condition 


(3") a.<a 


ij i, [i] +1 for every i, j,izj, 


where [j] denotes j modulo n (i.e., the elements in each row increase to the right up to the 
diagonal element, the right hand element in each row being treated as the immediate prede- 
cessor of the left hand element), then every cumulative-row-vector sequence constructed out 
of A by the iterative algorithm will terminate at its s” term, k being specified as in 

m 


Theorem I. 


PROOF: As in the proof of Theorem I, it is sufficient to show that the on component 
% of the cumulative-row-vector vg, used in this proof will never be a minimum component 


of Vg - 


As before, 
0< d; <b; for every i+r 
with 


0<d, for some itr. 


| ai (rj+1° 


“oe ‘ielo2"  F.2 S Melo - ay) - 


But a [r]+1 > ap for every i+r, and d; > 0 for some i+ r; so Vv, > Vir] si 


Therefore, since ¥. is not a minimum component of vy, the cumulative-row-vector sequence 


cannot be continued by the addition of f, to v [x and Theorem II is proved. 


5. The iterative algorithm has been defined as using rows of the array A to construct 


a cumulative-row-vector sequence. It will in future be called the row algorithm. If the 
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similar column algorithm is specified to use columns instead of rows to form a cumulative- 
column-vector sequence, this sequence, if it terminates, will provide an all-positive solution 
of the y-set associated with A. Sufficient conditions for the termination of such a sequence 


would be given by Theorems I' and II', obtained from Theorems I and II by substituting y for 
x and exchanging column for row throughout. 

Since solutions for the x's of an x-set, or for the y's of a y-set, are invariant with 
respect to any linear transformation of the elements of A, condition (1) in the hypothesis of 


each of the four theorems may be weakened to 


(1') there are constants e and f such that 2.3 = ean, + f with Ke an integer for 


every r, Ss. 


In consequence of this, an iterative row or column algorithm with maximum substituted for 
minimum can be proved to yield terminating sequences by two theorems obtained by exchanging 
<and> in Theorems I and I'. (This exchange would make no difference to the contents of 
Theorems II and I'.) 


6. Completely mixed games 





Game will be used in the restricted sense of a two-person zero-sum finite game 
represented in normal form by an 1 x m array G, with elements Si} in which one player (L) 


chooses an L-strategy consisting of an l-fold weight-vector p = {P45 Po» «++ Py} and the other 


player (M) chooses independently an M-strategy consisting of an m-fold weight-vector 
q = {q,, Go, +++ Gm} this pair of independent choices resulting in the transfer by M to L of 


1 m 
he ye P; a ij outcome units. J. von Neumann (see [6], £ 17.6) proved the fundamental 
i=1 j=1 
existence theorem to the effect that, for every such game, there is a non-empty class \ of L- 


strategies and a non-empty class » of M-strategies which are such that, for some number g 
and 


1 
( for every Ser. * & P; 8ij > 
i=1 


m 


5 
for every q €u, ) qj Bij < g for i 
XY Pi 

j=1 





Strategies belonging to \ or to m are called optimal; an ordered pair (p,q) of optimal 


strategies, one from each class, is called a solution of the game. The number g, which is 
unique, is called the value of the game. 
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If (p,q) is a solution and g the value of the game represented by an 1 x m array 
G6 whose elements are ij (q,p) is a solution and - g the value of the game represented 


by the mx1 array -GT whose elements are - Si 


A game is said to be completely mixed (or a game with "all strategies active") if all 
its optimal strategies, for both players, are all-positive weight-vectors. I. Kaplansky ([3], * 2) 
proved that every completely mixed game is square (i.e., represented by a square array), and 
established results from which it immediately follows that a necessary and sufficient condition 
for the game represented by the nxn array A to be completely mixed is that, in the language 





of £2, both the x-set and the y-set of equations associated with A should have all-positive 


solutions. If this game is completely mixed, it has the unique solution (p,q), where 


A,,/T for every i, qa, = 4 A,,/T for every j, the game having value g = A/T. 
i 


j 
Since Z Pj aij = g for every j, and 2: qj aij = g for every i, the solution has the property 
i j os 


of being a simple solution (in the sense of [8], £4). 
If the game represented by A is completely mixed with solution (p,q), so also is 





every game represented by an array whose elements are linear transformations of elements of 
A. The game represented by A’, with elements ay i> is completely mixed with solution 


(q,p); it has the same value as that of the game represented by A. 


7. Sufficient criteria for a game represented by A to be completely mixed, which do not 
require calculating the cofactors of elements of A, have been established by Bohnenblust, 


Karlin and Shapley ([1], 615-19) for games with certain special diagonal properties. Their 
conditions can be slightly weakened; and the following two theorems are extensions of their 
results. 


THEOREM Il: If a game is represented by an nxn array A which is such that there 
is a weight-vector w = {W1, Wo, --- w,,} such that either 


(1) Sun * Le Wj ay for every i, r, s, rs (i.e., every nondiagonal element is less 
j 
than the w-weighted mean of every row, for a column weighting the same for each 


row) 
(2) Son < L Wi ij for every j, r, Ss, r+ (i.e., every nondiagonal element is less 
i 


than the w-weighted mean of every column, for a row weighting the same for each 


column), 
then the game is completely mixed with a value g not less than min am Wj aij or than 
. 2 


min Wi yj respectively, and A also satisfies the alternative condition ((2) or (1) respectively). 
ji 
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It is convenient to use as a lemma a result established incidentally by Bohnenblust, 
Karlin and Shapley ([1], § 16), whose proof will be followed. 


LEMMA: A square game whose value is greater than every nondiagonal element in its 
array is completely mixed. 


Let g be the value of the game. a 


ij < g for every i, j, i+ j. 


Suppose, if possible, that an optimal L-strategy p contains a zero component, say 


p. = 0. Then 2 p, a, = 2 —_— <¢ 2 p.g=2 p.g=g. But 2° p. a. < gis 
al ; "=~ hise “" Atee . [= 


impossible if p is optimal. Hence every optimal L-strategy is all-positive. 


Now suppose, if possible, that an optimal M-strategy contains a zero component, say 
qg, = 0. By a similar argument Lu q; ag) < 8 which requires ([6], § 17.9.1) that the component 
J 


P. of every optimal L-strategy P should be zero, which is impossible. Hence every optimal 


M-strategy must also be all-positive, and the game is completely mixed. 


PROOF OF THEOREM III WITH CONDITION (2): Let w = {W,, Wo» +++ w,} be a 


weight-vector such that a, < min )° Wi aij for every r, S,r+s. Regarding w as an L- 
j i 
strategy shows ([6], £17.5.2) that min 2° Ww; aij <g. Hence a,, < g for every r, S, r+ §; 
;4 
and, by the lemma, the game is completely mixed. 


Now let q_ be the optimal M-strategy. Then qj aij = g for every i, and 
j 


ag < min - Wj aij for the weight-vector w = q; so condition (1) is satisfied. 
- 3 


PROOF OF THEOREM III WITH CONDITION (1): If A satisfies condition (1) at 


will satisfy condition (2), and vice versa; so the previous argument will prove that the game 

represented by a is completely mixed, has the value g > min j Wj a 
ij 

condition (1). Hence the game represented by A, with the same value g, is completely mixed 


ij? and satisfies 


and satisfies condition (2). 

[The result proved by Bohnenblust, Karlin and Shapley ({1], £16) is the special case of 
Theorem III (without the last clause of the conclusion, which does not hold for the special case) 
in which the weight-vector w of the hypothesis is {1/n, 1/n, ... 1/n} so that the w-weighted 


means are arithmetic means. | 


THEOREM IV: If a game is represented by an nxn array A which is such that 


(1) aij < aj (j]+1 for every i,j,i+tj, 


(2) aij > a for every i, j, j + [i]+1 


i]+1, j 
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(where the notation [j] has the same meaning as in Theorem I), then the game is completely 
mixed. 


PROOF: Let g be the value of the game. 
Suppose, if possible, that an optimal L-strategy p contains a zero component, say 


Py = 0. Then 


x pa, . a <—) 2 ; 
7 Pi i, [t]+1 “Vs , Pi i, [t]+1 7 a Vs a * 5 ats 
Hence L Pi a [t]+1 > g, which requires that the component Nt] +1 of every optimal M- 


strategy q should be zero. But, if WUe}+1 = 0, 


a: = ba ; ; 
. 4; *(t] +1, j ji, jx [t] 2 4 t]+1, j : 


4 j= a ay <E- 


qe 
j,j+ [t]+1 
Hence 1 q; a . <g, which requires that the component p of p should be zero. 
5G Mel+4,3 [t]+1 


Repetitions of this argument show that from Py = 0 there follows that Dp; = 0 for every i, 


which is impossible. Hence every optimal L-strategy must be all-positive. 
A similar argument, starting from the supposition that Ve] aa” 0, will show that any 


optimal M-strategy q can contain a zero component only if all its components are zero, which 


is impossible. So every optimal M-strategy must also be all-positive, and the game is com- 
pletely mixed. 

[This proof follows that of Bohnenblust, Karlin and Shapley ([1], £17) for their result 
which is equivalent to the special case of Theorem IV in which every element in the j_ 
diagonal (i.e., the diagonal sloping downwards from ay;) is less than every element in the 


({i] + 1) diagonal, for j = 2, 3, ...n.] 


8. SOLUTION OF CERTAIN GAMES BY THE ITERATIVE ALGORITHMS 
The iterative algorithms will now be used to provide solutions and values for games 
satisfying the sufficient criteria for complete mixture given in Theorems III andIV. The row 
algorithm, by providing the solution of the x-set associated with the array A representing the 


game, will give the optimal L-strategy p = {P}, Po» «-- P,} and the value g of the game; the 


column algorithm, by providing the solution of the y-set, will give the optimal M-strategy 
q = {qy, Go, +++ q,} and also the value g. Pj = } A,,/7 will be the proportionate number of 
J 


th 


times the i~ row-vector is used in forming the cumulative-row-vector which terminates a 


Sequence constructed by the row algorithm, for every i; 4; = do Aij /T will be the proportionate 
i 


number of times the ~ column-vector is used in forming the cumulative-column-vector which 
terminates a sequence constructed by the column algorithm, for every j; g = A/T will, in each 
case, be-v/k, where v is the value of the equal components of the terminating cumulative 
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vector Vy occurring at the «th step in the iteration. (Unless the greatest common divisor of 


L Ajj, i = 1, 2,...n, is equal to the greatest common divisor of L Ajj, j=1,2,...n,v 
j 1 


and k will be different in the two cases, although v/k will be the same.) The solution 
(p,q) and value g of the game will thus be determined, by a double use of an iterative 


algorithm, without any calculation of co-factors, i.e., without inverting the matrix A. 


THEOREM V: If an nxn array A whose elements are all integers is such that there 


is a weight-vector w = {Ww , Wo, --- W,,} such that either 


(i) i. * L Wj ij for every i, r,s, rss, 
J 


(ii) a, < LU Ww; aj; 


ij for every j, r, S, rs, 
i 


then (a) every cumulative-row-vector sequence constructed out of A by the row algorithm 


will, by terminating, provide the unique optimal L-strategy and the value of the 
completely mixed game represented by A (or, what is equivalent, the all-positive 


solution of its associated x-set of equations), 
every cumulative-column-vector sequence constructed out of A by the column 


algorithm will, by terminating, provide the unique optimal M-strategy and the value 
of the game represented by A (or, what is equivalent, the all-positive solution of 


its associated y-set of equations). 


PROOF: The alternative conditions (i) and (ii) are equivalent, by Theorem II. (i) implies 
condition (3) of Theorem I, (ii) implies condition (3) of Theorem I' (specified in £5). Condition 
(2) of either of these theorems is a consequence of the game being completely mixed, established 
by Theorem III for a game whose array satisfies either (i) or (ii). So all the conditions in the 
hypotheses of both Theorems I and I' are satisfied, and the conclusions (a), (b) of Theorem V 
follow from those of Theorems I, I' respectively. 


THEOREM VI: The conclusions (a), (b) of Theorem V follow from the hypothesis that 
the game is represented by an n xn array A whose elements are all integers and which is 
such that 

(i) ay; < ai [j)+1 for every i, j, i+ j, 

(ii) aij > fi)41,j for every i, j,j+[i]+1. 


PROOF: Condition (i) is condition (3') of Theorem II, condition (ii) is condition (3') of 
Theorem II' (specified in © 5) when the rows and columns of A are re-indexed in reverse 
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order. Condition (2) of either of these theorems is a consequence of the game being com- 
pletely mixed, established by Theorem IV for a game whose array satisfies (i) and (ii). So all 
the conditions in the hypotheses of both Theorems II and II' are satisfied, and the conclusions 
(a), (b) of Theorem VI follow from those of Theorems I], II' respectively. 

The hypotheses of Theorems V and VI may be weakened (as was done in *5 for earlier 
theorems) so as to allow the elements of A to be linear transformations of integers. This will 


permit a new theorem to be obtained from Theorem V by substituting maximum for minimum 
in the algorithms and exchanging < and> throughout. 


9. Use of the algorithms to compute the least common multiple 
Suppose that A is an nxn diagonal array (i.e., a square array whose non-diagonal 








elements are all zero) with diagonal elements Ay, ao, --- a, all positive integers. Since A 
satisfies the conditions of Theorem V, the diagonal game represented by A is completely 


mixed; * and, since A is symmetric, the game's unique solution (p,q) has p = q, and 


this, together with the value of the game, may be determined by a single use of either the row 
-or the column algorithm. A cumulative-vector sequence constructed by either algorithm will 
terminate at its term vy having all its components equal to v, where v = 4/H (in the 


notation of ££2-4). For the case of A, A is ay ap --- a, and H is the greatest common 


divisor of A/a, A/ao — d/a,. Hence ([4], £194, Ex. IV) v is the least common multiple of 


ay, A, +++ aDe The algorithm applied to A will yield this least common multiple in 


(1/a, + 1/ao +eeet 1/a,)v steps, each step consisting in the addition of the corresponding 


element to one minimum component of a cumulative vector. The effectiveness of this very 
simple iteration for yielding the least common multiple v may be proved, without the use of 
Theorem V, by the consideration that a component attaining the value v in a cumulative vector 
of the sequence will be a maximum component, and thus its corresponding element cannot be 
added at the next step in the iteration until after all the components of a cumulative vector have 
become equal to v. 


10. Although inversion of a matrix is avoided in the Theorems V and VI which justify use of 
the iterative algorithms for solving certain sets of linear equations, an iteration concerned 
may be very long and an estimation of its length would require estimating all the co-factors. 
So it is unlikely that the algorithms would be of use for practical computation. Their interest 
would seem rather to lie in relation to the logic of probability, since they give the mixing 
probabilities for optimal strategies in the games concerned as proportions of moves in finite 
(though fictitious) classes of moves, classes which recur cyclically if the fictitious plays are 
continued past the terminating stage on the same principles as before. 





*Kaplansky ([3], $4c) proved that a diagonal game is completely mixed if and only if the 
diagonal elements of its array are either all positive or all negative. 
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NOTE ON SOLVING LINEAR PROGRAMS IN INTEGERS* 


G. B. Dantzig! 
The RAND Corporation 





A recent result of Gomory (Princeton) solved an outstanding prob- 
lem, namely that of solving linear programs in integers. Gomory showed 
how to add linear inequality constraints to a linear programming problem 
automatically in such a way that the extreme points of the resulting 
convex contain only integral solutions in the neighborhood of the mini- 
mum. In this paper an alternative method is given for generating these 
additional constraints in a way easy to justify and to apply. However it 
is not known whether these conditions will lead to a solution in a finite 
number of iterations of the simplex method as is true for the stronger 
Gomory conditions. Thus anyone considering their practical use should 
weigh the ease of generation against the extra number of iterations that 
will undoubtedly be required for convergence. 











A recent result of Gomory, "Essentials of an Algorithm for Integer Solutions to Linear 
Programs" [communicated to Bull. Amer. Math. Society in letter from Princeton dated April 
23, 1958], solved an outstanding problem, namely that of solving linear programs in integers. 
As usual, once the principle is clear it is easy to develop variants. This paper is one of these. 

The main problem considered by Gomory is that of finding a way of automatically adding 
linear inequality constraints to a linear programming problem so that the extreme points of the 
resulting convex contain only integral solutions or more specifically integral solutions that 
minimize a linear form. The word automatically has been underscored because the essential 
problem is not one of existence of such linear constraints, which is obvious, but one of 
generating constraints in an easy, practical way. 

The procedure given here is not the same as Gomory's for adding the additional 
constraints; indeed it is known to be weaker. It is worth noting because it is easy to see why 
the relation holds and because the relation has a simple form. Gomory has shown that his 
stronger conditions will lead to a solution in a finite rumber of iterations. It is not known 
whether the same is true for the weaker conditions generated in this paper. 


THEOREM 1: If a linear program problem in variables Xy, XQ, +++) X, has a basic 


feasible solution for basic variables X) Xq +++, X 


m 29; which is inadmissible for any reason, 


then the partial sum condition 





*The RAND Corporation Report P-1359, copyrighted 1958. 
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is satisfied for all admissible solutions with integral values and not the basic solution. 


PROOF: It is obvious that the basic solution does not satisfy the constraint, since 
x; = 0 for all j = m + 1, m + 2,...,m. Moreover there exists no other feasible solution 


with these particular nonbasic variables all zero since they determine uniquely the values of 
the remaining dependent variables X14» Xqy +++) Xp and this solution was inadmissible. It 


follows that for any admissible solution that the partial sum 


Xml * *m42 *°°° + 


must be positive. If the x; are integral this means that the sum must be at least one. Q.E.D. 


In general suppose the convex of solutions in the n-dimensional space of X4, Xq, +++, X, 
is defined by k linear inequalities 


n 


n(x) = )- ny x; > my sere 
j=1 


where nij and n; are positive and negative integers. An extreme point of the convex would be 


defined by the intersection of some n of the hyperplanes 1; (x) =n, for i = ij, ip,...,i,. If 
the extreme point is inadmissible for any reason then at least one of the conditions 7; (x) =n, 





must be violated for an admissible integral solution, hence for at least one i = iy, ig, wees iw 


T(x) >n, + 1 because nij and n; are integers and therefore 


EG 21+2 a, i = ij, in, ---, i 


n 


is a linear inequality not satisfied by the extreme point but satisfied by all admissible integral 
solutions. 

From a computational point of view, where n is not too large it is easier to assume 
some arbitrary objective form and to solve the dual linear programming problem because the 
additional constraints take the form of adding new columns to the matrix which can be auto- 
matically generated by forming sums of the columns in the basis except for the component in 
the cost row whose sum is increased by unity. In order not to tax the memory of a computer, 
only the original columns, the newly generated column, and the columns in the basis need be 
recorded; that is to say, earlier generated inequalities which correspond to columns not in the 
optimum basis can be dropped since they are slack at the attained extreme point. 





PROBLEMS 

It has been suggested that this journal might serve as a medium for publishing problems 
in the area of Logistics — with the idea that other persons might be interested in those prob- 
lems and might submit comments. Readers are invited to submit brief statements on applied 
and theoretical problems in Logistics. Address letters to Managing Editor, Naval Research 
Logistics Quarterly, Office of Naval Research, Washington 25, D. C. 


The following paper has been received: 
A DIFFICULTY IN LINEAR PROGRAMMING 
FOR TRANSPORTATION PROBLEMS 


Marvin Luther and John E. Walsh* 


Operations Research Division, 
Lockheed Aircraft Corporation 


ABSTRACT 





Linear programming has many worthwhile applications to transpor- 
tation problems of a relatively uncomplicated nature. For more com- 
plicated problems, however,a realistic linear- programming formulation 
can require such an excessive number of variables that obtaining a 
solution is not feasible. This paper shows that an excessive number of 
variables is usually required when different types of items are to be 
transported subject to time restrictions andon carriers that can simul- 
taneously handle several items. 











INTRODUCTION AND RESULTS 

A type of transportation problem that is of only a moderately general nature can be 
defined as follows: A number of items of I different types are to be transported in a system 
that contains L different locations, each of which is an initial pickup, an ultimate destination 
point, or both for at least one item. Each item has a specified time limit within which it must 
reach its ultimate destination, and the number of different time deadlines considered is D. 


*Now with the Operations Research Group of the System Development Corporation. 
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Only one type of carrier is considered, which can carry P or fewer payload units between any 
two locations (fractions of units are not considered). The items are standardized so that the 
quantity of items that results in p payload units for a carrier is . The system is such that 
only T different departure times for the carriers need to be considered. 

Given the various "costs" associated with transportation, this somewhat restricted 
problem can be placed in a linear-programming framework. The goal would be to transport 
the items to their ultimate destinations within the time restrictions and at minimum "cost." 
In principle, this is a straightforward application of linear programming. However, it is not a 
feasible application if an excessive number of variables are required for placing the problem 
in a linear-programming form. 

As is shown in the section under "Derivations," the number of variables required for a 
general solution of this problem by the standard linear-programming approach is 


Pp /Q,+ (L-1)ID-1 


V=L(L-1)T 


% 


As indicated by the following air-cargo example, the value of V is huge, even for simplified 
situations. 


AIR-CARGO EXAMPLE 

For the air-cargo situation considered, the carrier is a cargo aircraft, the locations 
are air-cargo bases, and a basic trip is a flight between two bases. The items are packages, 
item types correspond to densities of packages, and the conversion between numbers of pack- 
ages and units of airplane payload can be made on the basis of weight. The numerical problem 
consists of the determination of the number of variables required for the linear programming 
of a simplified example of this kind of air-cargo system. 

The system examined consists of only five bases. The time period considered is an 
entire day, and it is assumed that the use of four-hour increments is a sufficiently fine time 
subdivision for scheduling the departures of cargo aircraft. Division of the day into six-hour 
increments is assumed to be adequate with respect to deadline times for package deliveries. 
The cargo aircraft can carry a maximum payload of 50,000 lb, and units of 10,000 lb are 
assumed to furnish an adequate subdivision for payload. Packages are assumed to weigh 2000 
lb each (a great simplification) so that Q, = 5p. Finally, only three different densities are 
assumed to exist for the packages so that I= 3. (Density is a consideration in the "costs" 
associated with loading, unloading, etc.) 

In summary, for the air-cargo example considered, L= 5, T= 6, D= 4, P= 5, 

= 5p, I= 3. By substituting these values into the expression for V, it is found that 
6.3 x 1018 different variables are required for the linear programming of this simplified 
air-cargo example. This number is far in excess of the capabilities for any presently avail- 
able, or anticipated, method for solving linear-programming problems. 


DERIVATIONS 

The procedure for putting this transportation problem into a linear-programming form 
consists in using individual carrier trips between two specified locations as the fundamental 
consideration. The linear-programming variables consist of all the different kinds of basic 





NOTES 


trips that can occur in the system. A basic trip is identified by the location it leaves, its 
destination, the time it leaves, the number of payload units carried, and the trip character- 
istics of the items carried (numbers, types, ultimate destinations, deadline times). 

Before specifying the variables needed for representing the possible kinds of basic 
trips, some additional notation is introduced: 


symbols for locations (u # v = 1, ..., L) 


time when carrier leaves u(t, i ee 


number of different kinds of trip characteristics that can be associated with an 
item (all possible combinations of the values for ultimate destination, deadline 
time, and type of item) 


(L - 1) ID 
qx, number of items with the k-th kind of trip characteristic (k = 1, ..., C) 


Then the variables required for specifying the different basic trips between any two locations 
are 


x (u, V, ty P, qy preey Gc)» 
where the q's are such that 


Cc 


2, %K= Qp- 


k=1 


The problem is to determine V, the number of different x's. 

The combination u,v, can be selected in L(L - 1) ways, while ty has T possible 
values and p has the allowable values 0, 1,..., P. For given p, the number of possibilities 
for the q's equals the number of different ways of placing Q5 identical balls into C different 
boxes. This enumeration has the value* 


Qnp*+C-1 
2p 


A multiplication of these numbers of possibilities and a summation of the resulting expression 
over the allowable values for p yields 


Pp 
L(L-1)T bm 
p=0 


the number of different x's, which verifies the expression used for V. 


Q,+Cc-1 


2p 





*W.A. Whitworth, Choice and Chance, 5th edition, G. E. Stechert and Co.; New York, 1942, p. 90. 








NOTES 


Remark: At first glance, the number of different x's might appear to be 


L(L - 1) T (PC + 1), 


since P represents the nonempty payload-unit possibilities for a carrier and C represents 
the number of different kinds of trip characteristics for items. (The 1 in PC + 1 accounts 
for routings of empty carriers.) However, this viewpoint does not allow for the combinations 
of trip characteristics that can occur with respect to the loads that can be handled by a car- 
rier. The actual number of different variables can be enormously greater than the value of 
L(L - 1) T (PC + 1). For the air-cargo example, the value of V is 6.3 x 1018 while the 
value of L(L - 1) T (PC + 1) is only 2.9 x 104. 





NEWS AND MEMORANDA 


Readers are invited to submit to the Managing Editor items of general interest in the 


field of logistics 


Saul I. Gass, formerly Applied Science Representative for IBM, has recently been appointed 


Chief, Operations Research Branch, Corporation for Economic and Industrial Research, 


Arlington, Virginia. 


Notice of Meetings: 


Operations Research Society of America; Seventh Annual Meeting; May 14, 15, 1959; The 
Shoreham Hotel, Washington, D. C. 


Sessions are being planned on: 


2 


2 
3. 
4. 
b) 
6 


Measures of Effectiveness 
New Techniques and Applications 
Operations Research in Government (non-military) 


Operations Research in the Social Sciences 


. Applications of Operations Research to the Management of Research 
Others to be decided. 




















RECENT PUBLICATIONS 


EDITOR'S NOTE: Listing of a publication in this section is for record and reference only and 
does not constitute an endorsement of point of view or advocacy of use. 


STUDIES IN THE ECONOMICS OF TRANSPORTATION. By Beckman, McGuire, and Winsten, 
Yale University Press, New Haven, 1956, xix 232 pages. 

Both highway traffic and railroad transportation systems are treated in terms of models, 
or, to paraphrase a sentence from the introduction, of simplified conceptual counterparts of 
the systems studied. The construction and analysis of these models are carried through with 
notable care and lucidity; not only are the variables and parameters identified and their rela- 
tionships plainly set forth, but discussion is provided which makes amply clear the scope and 
limitations, the nature of the reasoning involved, and the results obtained in each phase. The 
more mathematical parts of the analysis are set apart in separate sections which can be by- 
passed without sacrifice of continuity. A substantial bibliography is provided. 


A COMPREHENSIVE BIBLIOGRAPHY ON OPERATIONS RESEARCH. Compiled by the 
Operations Research Group, Case Institute of Technology. John Wiley and Sons, Inc., New York, 
1958, xi + 188 pages. $6.50. 

To write a fair and accurate appraisal of so catholic a collection must be nearly as difficult 
as the original job of compilation. To completely evaluate the volume at hand might require its 
extensive use in connection with a variety of projects, plus an encyclopaedic knowledge of the 
body of literature which it ostensibly indexes. Since this review is being written with these two 
prerequisites at best partially fulfilled, the reader is asked to take the following observations 
as suggestive, rather than conclusive. 

The Bibliography lists by author and classifies over three thousand titles published through 
December 1956; a supplement covering the better-known publications for 1957 is appended. A 
unique classification scheme permits economy of space; each entry is assigned a ten-digit “tag”. 
The first digit gives “general classification” (survey and review, technical and theoretical, 
methodological etc. — 10 entries); the second and third indicate the “type of organization involved” 
(industry group, military branch, financial etc. - 52 entries); the fourth and fifth are concerned 


with “function or type of activity” (management, production, logistics etc. — 25 entries); the 


sixth and seventh describe “techniques” (inventory theory, queuing, statistics etc. - 20 entries); 


the eighth and ninth indicate “aspects of research” (measures of effectiveness, decision theory, 
value theory etc. — 20 entries); and the last two classify “aspects of practice” (ethics, personnel 


Selection, philosophical and conceptual etc. - 11 entries). Research is no doubt greatly expedited 
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by the accompanying set of punched cards, available separately for about $30.00, and an elec- 


tric sorter. Scanning the appropriate columns for pertinent entries through 175 pages could be 
laborious. 


To properly evaluate the classification system, it seems necessary to use the Bibliography 


in connection with a familiar topic. However, it can be observed that certain conflicts are bound 
to arise; in the “techniques” category, for instance, papers classified under “inventory theory” 
may be of fundamental interest to someone studying dynamic programming or servomechanism 
theory. But this cannot be ascertained from the classification number. 

A section of the book is devoted to “special bibliographies,” which lists references by topic; 
this would seem to be of little help, since there are only 36 categories, and those categories with 
large lists were deliberately rejected in interests of space. Thus there is no bibliography of 
linear or dynamic programming, or production scheduling. There seems to be some doubt as to 
the completeness of these special bibliographies (Terborgh’s Dynamic Equipment Policy, which 
is included in the author file, does not appear in the section on “replacement and renewal” ); and 


further doubt as to the appropriateness of at least one of the categories arises in-noting that 





renewal theory references are grouped with equipment replacement policy references. 
Perhaps everyone who has worked or dabbled in operations research (or more accurately, 
the fields from which operations research as it is constituted draws its techniques and tools) 
has his own ideas as to which works have contributed most to the advancement of this body of 
knowledge. It would be difficult for the reviewer to conceive of any list of such works which 
excluded these titles, all conspicuously missing from the volume at hand: 
1. and 2. A. Wald: Statistical Decision Functions, and Sequential Analysis; these are 


pioneering works in modern statistical decision theory. 





P. Masse: Les reserves et la régulation de l’avenir dans la vie economique; this 





work, published in 1946, is one of the first mathematically interesting contribu- 
tions to inventory theory, and is basic to the large body of analysis of hydro- 
electric operation and water control. 


. W. Feller: Introduction to Probability Theory and its Applications; this is a 





standard reference work and text; further, it contains interesting material on 
queuing problems. 
H. H. Goode and R. W. Machol: Systems Engineering; an introduction to a wide 


range of tools and problems usually identified with operations research. 





. E. F. Beckenbach (ed.) Modern Mathematics for the Engineer; this collection 
contains papers by recognized authorities on some of the mathematical techniques 
of interest in operations research, as well as a paper on operations research. 

And others, listed without comment: U. Grenander and M. Rosenblatt: Statistical Analysis 
of Stationary Time Series, L. J. Savage: The Foundations of Statistics, W. Shewhart: Economic 
Control of Quality of Manufactured Product, Philip Wolfe: The Simplex Method for Quadratic 
Programming (RAND Paper P-1205, 25 October, 1957), E. B. Wilson: An Introduction to 


Scientific Research, A. W. Tucker: Game Theory and Linear Programming, (notes available 
































RECENT PUBLICATIONS 85 


from the Oklahoma State University bookstore, Stillwater); R. Dorfman, P. A. Samuelson, and 
R. M. Solow: Linear Programming and Economic Analysis, R. G. D. Allen: Mathematical 





Economics, and W. Leontief: The Structure of the American Economy. 





It may be argued that these works were omitted on the ground that they are books about 
economics, or about statistics, or mathematics, or engineering. The same is as true of such 
listed works as E. F. Beach: Economic Models, or R. R. Bush and F. Mosteller: Stochastic 
Models for Learning, or T. C. Fry: Probability and its Engineering Uses. It is true that there 











was a considerable problem of selection; however, with operations research such an ill-defined 
term, the problem could certainly have been foreseen, and since the Bibliography set out to list 
almost all works of interest to model builders and decision makers (as was necessary to obtain 
anything of interest), it is hard to understand the above omissions; overlook obscure articles, 
yes, but not the foundations of statistical decision theory (Wald) or the original work in input- 
output analysis (Leontief). 
There are many fine general and special bibliographies in such books as R. D. Luce and 
H. Raiffa: Games and Decisions, T. M. Whitin: The Theory of Inventory Management (second 








edition), I. D. Bross: Design for Decision, the previously mentioned works of Goode-Machol 





and Dorfman-Samuelson-Solow, and Churchman-Ackoff-Arnoff: Introduction to Operations 





Research. No indication is found in the volume at hand of the existence of these helpful supple- 
mentary reference lists; an asterisk could have sufficed to single out those works included in 
the Bibliography which themselves contained unusual or extensive bibliographies. 


Concluding item: Stephen Potter’s Theory and Practice of Gamesmanship is waggishly 





indexed 8-00-00-14-00-9; it is a book about “other aspects of practice” or “game theory.” 


Princeton University (Reviewed by Daniel Orr) 


MODERN COMPUTING METHODS. Philosphical Library, New York, 1958, 129 pages. $8.75. 

A neat little book on computing methods based on a series of lectures given at the Imperial 
College of Science and Technology, England. 

With the increased availability of high-speed digital computers there has been an increased 
interest in numerical techniques. This book gives a brief look at a large number of such 
techniques and almost everyone is sure to find something new here. 

The fields covered are: linear equations and matrices; roots of polynomial equations; 
latent roots of matrices; finite-difference methods; ordinary differential equations; hyperbolic, 
elliptic, and parabolic partial differential equations; relaxation methods; tabulation of mathe- 
matical functions; and computation of mathematical functions. 

Also included are: a large bibliography; a discussion of the English computer DEUCE; 
and a discussion of the mechanical differential analyser. 


(Reviewed by Jean Campbell) 
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NAUTILUS 90 NORTH. By Commander William R. Anderson, USN, with Clay Blair, Jr., 
The World Publishing Company, 238 pages plus index and photographs. 

Because a purely scientific account of the preparations for the Nautilus’ trip under the ice 
pack of the North Pole could hardly make the best seller list, the author has slanted this 
account toward the nonscientific reader. However, the backbone of the book is the account of 
the research necessary for the successful completion of this trip. 

Commander Anderson gives a highly readable account of the procurement and on-ship 
testing of the Mark 19 gyrocompass and the N6A inertial navigator, the upward-scanning 
fathometer or ice detector, and the procurement of maps and charts of hitherto uncharted 
regions. His account describes the measures which were to be taken should the Nautilus 
become incapacitated while under the ice pack. 

Information on the North Pole region has been scarce and conflicting for this has been a 
truly unexplored region. Thus, the observations and data concerning this region in Nautilus 
90 North make a valuable contribution to the scientific literature. 


(Reviewed by Jean Campbell) 


CONCEPTS OF NAVY LOGISTICS ORGANIZATION. 

Part I - “The Diamond of Logistics” (Special report reprinted from the May 1958 BuSandA 
Newsletter) by Captain E. D. Stanley, Jr., SC, USN. 

Part II - “The Club of Transportation” (Special report reprinted from the September 1958 
BuSandA Newsletter) by Captain E. D. Stanley, Jr., SC, USN. 

(NOTE: Copies of the above special reports may be obtained by writing to: BuSandA, 

Code N1, Room 0211, Main Navy Building, Navy Department, Washington 25, D. C.) 





The above special reports are both interesting and informative and are significant contri- 
butions to the broad study of logistics; they should be of particular interest to persons concerned 


with logistics management and administration. Part I deals primarily with what is termed 


“Material Logistics.” The author has chosen a two-dimensional figure, the diamond, to 
represent the following aspects and categories of the subject: 


(1) producer and consumer aspects; 

(2) technical and supply functions; 

(3) levels of supply; and 

(4) commodity management. 

Part II is concerned with transportation, one of the key elements of logistics. The author 

has chosen the figure of a club to represent the following elements of the subject: 

(1) shipping and movement control; 

(2) traffic management; 

(3) terminal operations; and 


(4) transport operations. 
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KAFFEE IM GEMEINSAMEN MARKT. EINE NICHT-LINEARE PROGRAMMANALYSE. 
By Freidrich Wendt, Hamburgisches Welt-Wirtschafts-Archiv, Hamburg, 1958. (In German). 
This interesting study, published by The Hamburg Archives of World Economy, recently 


came to the attention of the Quarterly. The consequences of the Common Market on the import 


of single goods is investigated, with coffee being used as an example. 
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